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Final  Report  for  the  DOD  Grant  DAMD  17-98-1-8042 

Idea  Grant.  April  1998-April  2001 


Introduction 

The  objective  of  research  supported  by  this  Idea  Grant  was  to  attempt  the  development  of 
new  designs  of  conditional  toxins  that  are  sensitive  to  the  presence  of  more  than  one  molecular 
target,  and  to  begin  implementing  these  new  reagents  as  therapeutic  agents  against  breast  cancer. 

Body 

During  three  years  of  support  by  the  Idea  Grant  we  carried  out  several  projects,  described 

below. 

Signal-regulated,  cleavage-mediated  toxins 

During  the  last  two  years,  we  essentially  completed  the  work  on  the  first  example  of  a 
cleavage-mediated  conditional  toxin.  A  draft  of  the  paper  about  this  class  of  conditional  toxins, 
will  be  submitted  for  publication  in  the  Fall  of  2001  (a  preliminary  preprint  of  this  work  was 
submitted  with  the  year-2000  Progress  Report  (I.  Davydov  &  A.  Varshavsky.  Conditional  toxin 
against  yeast  cells  expressing  HIV  protease).  (We  were  delayed  in  finishing  this  work  by 
unexpected  difficulties  with  remaining  control  experiments.) 

Construction  of  degradation  signals  in  the  lysine-asparagine  sequence  space 

Among  the  problems  to  be  solved  on  the  way  to  multitarget  conditional  toxins  is  the 
paucity  of  strong,  portable  N-degrons  that  are  free  of  drawbacks  of  the  existing  degrons  of  this 
class.  These  drawbacks  include  the  tendency  of  proteins  bearing  such  degrons  to  undergo 
endoproteolytic  cleavages  that  sever  the  N-degron  from  the  rest  of  the  protein,  and  thereby 
contribute  to  the  high  background  of  assays  with  N-degron-bearing  proteins.  As  a  part  of  our 
ongoing  effort  to  construct  better  degradation  signals,  including  better  degrons  for  conditional 
toxins,  we  developed  a  new  approach,  which  makes  it  possible  to  identify  and  isolate  specific 
degradation  signals  in  the  sequence  space  of  just  two  amino  acids,  lysine  and  asparagine 
A  paper  describing  these  results  was  published  in  late  1999.  (T.  Suzuki  &  A.  Varshavsky. 
Degradation  signals  in  the  lysine-asparagine  sequence  space.  EMBO  J.  18:6017-6026, 1999.) 
Preprint  enclosed.! 
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Bivalent  Inhibitor  of  Protein  Degradation  by  the  N-End  Rule  Pathway 

In  a  project  relevant  to  the  theme  of  this  grant,  we  have  developed  a  specific  bivalent 
inhibitor  of  the  N-end  rule  pathway.  The  relevance  of  this  advance  stems  from  the  fact  the 
bivalent  inhibitor  (described  below)  improved  the  understanding  of  the  targeting  (substrate¬ 
binding)  sites  of  N-recognin,  the  E3  component  of  the  ubiquitin-dependent  N-end  rule  pathway, 
which  is  at  the  center  of  our  sitoxins  and  comtoxin  projects.  Eventually,  the  designs  of 
conditional  toxins  should  be  able  to  involve  any  of  the  several  ubiquitin-dependent  proteolytic 
pathways;  the  current  focus  on  the  N-end  rule  pathway  reflects  not  only  the  interest  of  this 
laboratory  but  also  the  fact  that  this  pathway  is  particularly  well  understood  mechanistically. 
Moreover,  in  staying  within  the  N-end  rule  pathway,  it  is  possible  to  use  the  portable,  highly 
active  N-degrons,  an  advantage  that  is  not  available  with  most  of  the  other  ubiquitin-dependent 
pathways.  A  paper  describing  these  results  was  published  in  1999.  (Y.  T.  Kwon  & 

A.  Varshavsky.  Bivalent  inhibitor  of  the  N-end  rule  pathway.  J.  Biol.  Chem.  274:18135-18139, 
1999.)  (reprint  enclosed.) 

Construction  and  Use  of  the  Library  Expressing  Random  Fragments  of  the 
Green  Fluorescent  Protein  (GEPl  and  Src  Protein  Kinase 

The  aim  of  this  project  was  to  develop  a  generally  applicable  method  for  interfering  with 
the  in  vivo  function  of  a  target  protein  of  interest  through  the  expression  of  a  specific  fragment  of 
the  target  that  would  bind  to  a  conformationally  immature  (nascent)  target  protein  and  thereby 
either  arrest  or  strongly  retard  its  conformational  maturation.  As  a  result,  the  complex  of  the 
target  and  its  self-peptide  would  be  nonfunctional  insofar  as  the  target’s  function  is  concerned. 

In  addition,  the  interfering  peptide  would  be  equipped  with  a  strong  N-terminal  degradation 
signal  (the  N-degron,  previously  studied  by  this  laboratory).  A  permanently  or  transiently  stable 
complex  between  a  target  protein  and  its  self-peptide  bearing  an  N-degron  would  be  targeted  for 
processive  degradation  by  the  N-end  rule  pathway,  one  of  the  pathways  of  the  ubiquitin  system. 
Once  developed,  this  method  could  be  applied  to  effect  selective  functional  inhibition  of 
undesirable  proteins  such  as,  for  example,  overexpressed  oncoproteins.  This  trans-degradation 
method  would  also  become  a  part  of  the  comtoxin  approach  described  in  the  Idea  Grant  that 
supports  these  studies.  To  develop  the  trans-degradation  technique,  we  chose  Green  Fluorescent 
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Protein  (GFP)  as  a  model  target  protein.  (Our  next,  clinically  relevant,  target  is  Src,  a  protein 
tyrosine  kinase  and  an  proto-oncoprotein  activated  in  a  number  of  human  cancers,  including 
breast  cancer.)  We  carried  out  the  construction  of  a  library  of  clones  which  expressed 
polypeptide  fragments  derived  from  random  segments  of  GFP.  These  fragments  were  expressed 
in  the  yeast  S.  cerevisiae  from  the  Pgal  promoter  as  linear  fusions  to  ubiquitin.  As  shown 
previously  by  this  laboratory,  the  N-terminal  ubiquitin  moiety  of  the  fusion  is  cotranslationally 
cleaved  off  by  ubiquitin-specific  proteases,  making  it  possible  to  expose  any  desired  amino  acid 
residue  at  the  N-terminus  of  the  resulting  polypeptide  fragment.  In  developing  this  approach,  we 
proceeded  in  steps,  at  first  constructing  a  ubiquitin  fusion-based  library  of  GFP  fragments  as 
such.  The  second-generation  library  should  have  the  same  fragments  bearing  the  40-residue 
N-terminal  extension  called  Arg-e^*^,  which  contains  the  above-described  N-degron.  The  library 
expressing  random  GFP  fragments  was  constructed  using  DNase  I  in  the  presence  of  Mn2+  ions 
to  fragment  the  GFP  cDNA,  and  specific  oligonucleotide  adapters  to  insert  the  resulting 
fragments  into  the  expression  vector.  In  2000,  we  began  the  construction  of  analogous  libraries 
with  v-Src  and  c-Src  trans-targeting,  degron-containing  reporter.  The  v-Src  and  c-Src  cDNAs 
were  obtained  from  Dr.  D.  Morgan  (Univ.  of  California,  Berkeley).  They  were  tested  by 
sequencing  the  5’-  and  3 ’-ends  (-200  bp  each)  of  the  coding  regions,  and  confirming  that  the 
resulting  sequences  matched  those  for  v-Src  and  c-Src  in  the  GenBank  database.  Restriction 
fragments  containing  the  Src  genes  were  isolated  by  agarose  gel  electrophoresis,  and  are  being 
used  to  construct  random-fragment  libraries  as  described  above.  Taking  advantage  of  the  unique 
Nco  I  site  at  the  5’ -end  the  v-Src  cDNA,  the  v-Src  coding  region  (as  a  filled-in  Nco  I-Xho  I 
fragment)  was  inserted  into  the  integration  vector  pIP123-CUPR.  This  placed  the  Src  ORF 
under  the  control  of  the  copper-inducible  Pcupi  promoter.  This  part  project  ran  into  technical 
difficulty,  primarily  because  this  promoter  was  insufficiently  tight  (not  low  enough  level  of  Src 
in  the  uninduced  state).  (In  contrast  to  the  GFP-based  library,  where  the  screen  is  being  carried 
out  by  fluorescence-activated  cell  sorting  (FACS),  the  readout  and  the  screen  with  Src  are  based 
on  the  known  toxicity  of  mammalian  Src  to  the  yeast  S.  cerevisiae.)  Examination  of  the 
three-dimensional  structures  of  Src  family  tyrosine  kinases  shows  that  a  loop  connecting  the  SH2 
domain  to  the  kinase  domain  is  bound  to  the  SH3  domain  of  Src  and  sandwiched  between  the 
SH3  domain  and  the  N-terminal  half  of  the  kinase  when  Src  is  in  the  inactive  state.  The 
kinase-inhibiting  interaction  requires  phosphorylation  of  Src  at  its  C-terminal  region,  itself  bound 
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to  the  SH2  domain.  This  area  is  a  promising  candidate  for  a  trans-targeting  N-degron.  Our  aims 
in  this  set  of  projects  were,  first,  to  optimize  the  library-based  interference  approach  described 
above,  using  GFP  as  both  the  target  and  reporter,  then  to  expand  this  approach  to  the  Src  kinase, 
and  to  link  an  N-degron  to  the  interfering  fragments  identified  in  both  the  GFP  and  Src  library 
screen.  This  multistep  strategy  should  result  in  a  generally  applicable  method  for  metabolically 
destabilizing  (and  thereby  functionally  inhibiting)  any  protein  of  interest,  including  the  targets 
that  must  be  down-regulated  for  either  arresting  or  selectively  killing  the  breast  adenocarcinoma 
cells. 

Unfortunately,  over  the  last  year,  this  ambitious  project  ran  into  technical  difficulties, 
which  remain  to  be  overcome. 

Designing  Asparagine-Lvsine  Degrons  That  Work  in  Mammalian  Cells 

We  began  to  test  the  lysine-asparagine  N-degrons  of  S.  cerevisiae  that  were  identified  in 
the  preceding  work  supported  by  this  grant  (Suzuki  &  Varshavsky.  Degradation  signals  in  the 
lysine-asparagine  sequence  space.  EMBO  J.  18:6017-6026,  1999)  (reprint  enclosed)  in 
mammalian  cells.  Our  aim  is  to  determine  whether  any  of  the  highly  active  yeast  N-degrons  of 
this  class  that  have  been  identified  in  yeast  are  comparably  powerful  in  mammals.  If  they  are, 
several  applications,  including  those  that  encompass  conditional  toxins  will  become  possible 
right  away.  If  these  degrons  are  not  as  strong  in  the  mammalian-cell  setting,  we  will  modify 
them,  using  techniques  that  have  already  proved  successful  in  yeast,  to  produce  a  set  of  degrons 
that  would  confer  on  a  mammalian  protein  reporter  an  in  vivo  half-life  of  3  minutes  or  less.  With 
degrons  of  this  quality,  one  could  design  proteolysis-based  conditional  mutants  in  mice,  and 
conditional  toxins  as  well.  This  project  is  continuing. 

Enclosed  with  this  Final  Report  are  the  recently  published  papers  by  the  lab  (1999-2001; 
three  sets),  and  three  copies  of  the  Pi’s  CV.  The  Final  Report  describes  projects  that  were 
supported,  either  partially  or  entirely,  by  the  DOD  Idea  Grant  in  1998-2001. 
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Key  Research  Accomplishments 


•  Development  of  the  cleavage-regulated  conditional  toxin. 

•  Construction  of  degradation  signals  in  the  lysine-asparagine  sequence  space. 

•  Designing  asparagine-lysine  degrons  that  work  in  mammalian  cells. 

•  Construction  and  use  of  random  fragments  of  GFP  and  Src  as  sources  of  trans-targeting 
fragments.  (This  latter  project  ran  into  serious  technical  difficulties  and  remains 
uncompleted.) 


Reportable  Outcomes 

•  Published  papers  (they  are  cited  in  the  Body  of  the  Final  Report). 

•  Paper  in  preparation  that  reports  the  development  of  cleavage-regulated  conditional  toxin. 

•  Construction  and  characterization  of  highly  active  degrons  in  the  Lys-Asn  sequence  space. 

•  Although  the  approach  that  involved  the  construction  of  protein  fragments-based  trans¬ 
targeting  library  has  not  been  successful,  thus  far,  this  idea,  supported  by  the  Idea  Grant,,  was 
a  valuable  experience  in  the  library  design,  and  will  contribute  to  our  further  effort  in  this 
area. 

No  invention  disclosures  or  patents  based  on  the  work  supported  by  this  Idea  Grant  have 
been,  or  will  be  filed. 


Conclusions 

Please  see  Key  Research  Accomplishments  above.  The  statements  there  are  the  main 
conclusions  of  work  supported  by  this  Idea  Grant. 


Appendices 

Reprints  of  the  cited  papers  by  the  lab,  other  papers  by  the  lab  (1998-2001),  and  Pi’s  CV. 
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Cohesion  between  sister  chromatids  is  established  during  DNA 
replication  and  depends  on  a  protein  complex  called  cohesin^"^.  At 
the  metaphase-anaphase  transition  in  the  yeast  Saccharomyces 
cerevisiae,  the  £5P1 -encoded  protease  separin  cleaves  SCCl,  a 
subunit  of  cohesin  with  a  relative  molecular  mass  of  63,000  (M^ 
63K)®.  The  resulting  33K  carboxy-terminal  fragment  of  SCCl 
bears  an  amino-terminal  arginine — a  destabilizing  residue  in 
the  N-end  rule^.  Here  we  show  that  the  SCCl  fragment  is  short¬ 
lived  {ti/2  ~  2  min),  being  degraded  by  the  ubiquitin/proteasome- 
dependent  N-end  rule  pathway  Overexpression  of  a  long-lived 
derivative  of  the  SCCl  fragment  is  lethal.  In  ubrlA  cells,  which 
lack  the  N-end  rule  pathway^,  we  found  a  highly  increased 
frequency  of  chromosome  loss.  The  bulk  of  increased  chromo¬ 
some  loss  in  ubrlA  cells  is  caused  by  metabolic  stabUization  of  the 
ESP  1 -produced  SCCl  fragment.  This  fragment  is  the  first  physio¬ 
logical  substrate  of  the  N-end  rule  pathway  that  is  targeted 
through  its  N-terminal  residue.  A  number  of  yeast  proteins  bear 
putative  cleavage  sites  for  the  ESPl  separin,  suggesting  other 
physiological  substrates  and  functions  of  the  N-end  rule  pathway. 

The  sister  chromatids  of  a  replicated  chromosome  are  pulled 
apart  by  the  centromere-attached  microtubules  that  emanate  from 
spindle  poles^”^.  However,  until  the  onset  of  anaphase  the  forces 
pulling  sister  chromatids  apart  are  counteracted  by  cohesion  that 
holds  sisters  together.  A  complex  termed  cohesin  is  essential  for 
sister  chromatid  cohesion  in  both  fungi  and  metazoans.  In  the  yeast 
S.  cerevisiae,  cohesin  contains  four  subunits,  SMCl,  SMC3,  SCCl 
(also  called  MCDl  and  RAD21)  and  SCC3  (see  refs  1-5).  Perturba¬ 
tions  of  chromosome  segregation  are  among  the  causes  of  human 
cancer  and  other  diseases. 

Previous  work  has  shown  that  S.  cerevisiae  SCCl ,  a  63K  subunit  of 
cohesin,  is  cleaved  at  the  metaphase-anaphase  transition,  in  a 
reaction  that  requires  the  187K  ESPl,  termed  separin^  ESPl,  a 
cysteine  protease  of  a  class  that  includes  caspases^^^,  is  activated  at  the 
onset  of  anaphase  through  ubiquitin  (Ub) -dependent  destruction 
of  its  inhibitory  ligand  PDSl  (see  refs  1-3, 7).  A  protein  substrate  of 
the  Ub  system  is  conjugated  to  Ub  through  the  action  of  enzymes 
El,  E2  and  E3,  with  the  degradation  signal  (degron)  of  the  substrate 
recognized  by  E3.  The  resulting  multiubiquitylated  substrate  is 
degraded  by  the  26S  proteasome  (see  refs  11,  12). 

The  ESPl  separin  cleaves  the  566-residue  SCCl  at  two  sites, 
180  and  268  residues  from  its  N  terminus,  the  latter  cleavage  site 
being  the  principal  one®  (Fig.  la).  Mutations  in  both  (but  not  single) 
SCCl  cleavage  sites  that  inhibit  the  ESPl -mediated  cleavage  of 
SCCl  are  lethal;  they  preclude  sister  chromatid  separation®’ The 
N-terminal  Arg  of  ESPl -produced  SCCl  fragments  is  a  destabiliz¬ 
ing  residue  in  the  N-end  rule^’^^  The  degrons  recognized  by  the  N- 
end  rule  pathway,  called  N-degrons,  consist  of  two  determinants, 
a  destabilizing  N-terminal  residue  and  an  internal  Lys  residue^^. 
UBRl,  a  225K,  RING-H2  finger-containing  E3  of  the  N-end  rule 
pathway^^  recognizes  eight  primary  destabilizing  N-terminal  resi¬ 
dues:  Arg,  Lys,  His,  Phe,  Leu,  Tyr,  Trp  and  He.  The  other  four 
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destabilizing  residues.  Asp  and  Glu  (secondary),  and  Asn  and  Gin 
(tertiary),  frnction  through  their  enzymatic  modification  and 
conjugation  to  Arg,  a  primary  destabilizing  residue^.  UBRl  has  at 
least  three  substrate-binding  sites.  The  type  1  site  is  specific  for  N- 
terminal  Arg,  Lys  and  His.  The  type  2  site  is  specific  for  N-terminal 
Phe,  Leu,  Trp,  Tyr  and  He.  The  third  site  of  UBRl  is  specific  for 
proteins  such  as  CUP9  and  GPAl,  which  bear  internal  (non-N- 
terminal)  degrons The  functions  of  the  N-end  rule  pathway 
include  the  control  of  peptide  import  in  S.  cerevisiae,  by  regulated 
degradation  of  CUP9 — a  transcriptional  repressor  of  the  PTR2- 
encoded  peptide  transporter^^.  Engineered  N-end  rule  substrates 
can  be  produced  through  the  Ub  fusion  technique,  in  which  a  Ub- 
reporter  fusion  is  cleaved,  co-translationally,  at  the  Ub-reporter 
junction  by  deubiquitylating  enzymes  (DUBs),  yielding  a  reporter 
bearing  a  predetermined  N-terminal  residue^^’^^.  Neither  type  1  nor 
type  2  physiological  N-end  rule  substrates  have  yet  been  identified 
in  5.  cerevisiae. 

To  determine  whether  the  ESPl -produced  SCCl  fragments®  were 
degraded  by  the  N-end  rule  pathway,  we  took  advantage  of  the  fact 
that  dipeptides  bearing  destabilizing  N-terminal  residues  can  spe¬ 
cifically  inhibit  the  yeast  N-end  rule  pathway  in  vivo,  through  their 
import  into  the  cell  and  interaction  with  the  type  1  or  type  2  sites  of 
UBRl  (ref.  16).  S.  cerevisiae  K6843,  in  which  the  full-length,  C- 
terminally  Mycig-epitope- tagged  SCCl  (termed  SCCl"^)  was 
expressed  from  the  Pgali-io  promoter®,  were  arrested  in  a  meta- 
phase-like  state  with  nocodazole,  followed  by  the  induction  of 
SCCl”™  with  galactose.  The  dipeptides  Ala- Arg,  Arg-Ala,  Trp-AIa 
or  Leu- Ala  were  added  30  min  later,  and  the  incubation  was 
continued  for  another  30  min,  followed  by  extraction  of  proteins 
and  anti-Myc  immunoblotting. 

In  the  absence  of  added  dipeptides,  we  could  detect  only 
uncleaved  SCCl”™  (Fig.  lb,  lane  2).  However,  in  the  presence  of 
Arg-Ala,  bearing  a  type  1  destabilizing  N-terminal  residue,  an 
additional,  smaller  species  was  observed,  whose  size  was  consistent 
with  the  ESPl -produced  major  C-terminal  fragment  of  SCCl”™, 
termed  scci269-566m  ^35  absent  from 

cells  treated  either  with  Ala-Arg,  which  bore  a  stabilizing  N- 
terminal  residue,  or  with  Trp-Ala  or  Leu- Ala,  both  of  which  bore 
a  type  2  destabilizing  N-terminal  residue  (Fig.  lb,  lanes  1-6). 
Crucially,  scci269-566m  observed  in  the  absence  of 

dipeptides,  provided  that  the  cells  lacked  UBRl  and  therefore 
could  not  degrade  N-end  rule  substrates  (Fig.  lb,  lanes  4,  7).  No 
SCCl  fragment  derived  from  cleavage  solely  at  the  minor  site 
(residues  180-181)  was  observed  (Fig,  lb).  We  conclude,  in  agree¬ 
ment  with  an  earlier  inference®,  that  the  ESPl -mediated minor-site 
cleavage  of  SCCl  is  either  negligible  or  rapidly  followed  by  cleavage 
at  the  major  site. 

To  determine  the  in  vivo  half-life  of  Arg-SCCl^^^“^^^  it  was 
expressed  as  part  of  a  fusion  of  the  form  ^DHFR-Ub-Arg- 
SCC1^^^“^^^^,  where  T  denotes  the  N-terminal  and  C-terminal 
Flag  epitopes  linked,  respectively,  to  mouse  dihydrofolate  reductase 
(DHFR)  and  Arg-SCCl^^^'^^^  (Fig.  la).  DUBs  co-translationally 
cleave  this  fusion  at  the  C  terminus  of  the  Ub  moiety,  yielding  the 
test  protein  Arg-SCCl^^^”^^^^  and  the  long-lived  t)HFR-Ub  refer¬ 
ence  protein,  which  serves  as  an  internal  controP^’^^’^®.  In  agreement 
with  dipeptide  inhibition  experiments  (Fig.  lb),  Arg-SCCl^^^”^^^^ 
was  short-lived  in  UBRl  cells  (fi/2~2min)  (Fig.  Ic,  lanes  1-4). 
By  contrast,  the  same  Arg-SCCl^^^"^^^^  protein  was  long-lived 
{ti/2>2h)  in  ubrlA  cells,  indicating  that  its  degradation  in  wild- 
type  cells  was  mediated  bv  the  N-end  rule  pathway  (Fig.  Ic,  lanes  5- 
8;  and  le).  Arg-SCCl^^^"^^^^  was  destroyed  particularly  rapidly 
either  during  or  shortly  after  its  synthesis,  that  is,  during  the 
pulse.  Specifically,  nearly  90%  of  Arg-SCCl^^^~^^^^  was  degraded 
by  the  N-end  rule  pathway  during  the  3-min  pulse;  the  rest  of  pulse- 
labelled  Arg-SCCl^^^"^^^^  was  destroyed  more  slowly  {ty2  ^  2  min) 
(Fig.  le).  If  the  N-terminal  Arg  of  Arg-SCCl^^^"^^^^was  converted  to 
Met,  a  stabilizing  residue  in  the  N-end  rule,  the  resulting  Met- 
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(Fig.  la)  was  long-lived  in  both  UBRl  and  uhrlA  cells, 
in  contrast  to  Arg-SCCl^^^"^^^^  (Fig.  Ic,  d).  We  conclude  that 
Arg-SCCl^^^"^^^^is  targeted  for  degradation  through  its  Arg-bearing 
N-degron,  rather  than  through  an  internal  (non-N- terminal) 
degradation  signal. 

Overexpression  of  the  N- terminal  SCCl  fragment  (residues  1- 
268;  Fig.  la)  in  either  UBRl  or  ubrlA  cells  did  not  significantly  alter 
cell  growth  (data  not  shown).  In  contrast,  the  analogous  expression 
of  Arg-SCCl^^^“^^^^was  lethal  in  ubrlA  cells  (Fig.  2a).  In  UBRl  cells, 
where  Arg-SCCl^^^“^^^^was  short-lived  (Fig.  2a),  its  overexpression 
was  not  lethal  but  slowed  growth  (data  not  shown).  Overexpression 
of  the  Met-SCCl^^^"^^^^  was  lethal  in  both  UBRl  and  ubrlA  cells, 
consistent  with  metabolic  stability  of  this  SCCl  derivative  in  both 
genetic  backgrounds  (Fig.  2a). 

To  examine  toxicity  of  the  long-lived  Met-SCCl^^^“^^^^  in  more 
detail,  we  transiently  overexpressed  Met-SCCl^^^~^^^^from  the  Pgai] 
promoter  in  cells  that  were  arrested  either  before  the  S  phase  (with 
a-factor),  or  in  the  S  phase  (with  hydroxyurea),  or  in  a  metaphase¬ 
like  state  (with  nocodazole).  Transient  overexpression  of  Met- 


SCC1^^^“^^^^  was  largely  nontoxic  to  cells  in  G1  and  S  phases 
but  highly  toxic  to  mitotic  (nocodazole- arrested)  cells  (Fig.  2b), 
indicating  that  elevated  levels  of  SCCl^^^”^^^  may  interfere  with  a 
process  that  takes  place  specifically  at  metaphase-anaphase.  Con¬ 
trol  immunoblots  confirmed  that  the  levels  of  Met-SCCl^^^“^^^^ 
were  about  equal  in  cells  arrested  at  either  of  the  three  phases  of  the 
cell  cycle  (data  not  shown).  Furthermore,  we  found  that  cells 
remained  stably  arrested  in  the  presence  of  nocodazole  even  after 
overexpression  of  the  SCCl  fragment  (data  not  shown),  thus 
making  it  unlikely  that  abnormally  high  levels  of  this  fragment 
abrogated  the  MAD2 -dependent  checkpoint  and  the  ceU-cycle 
arrest  induced  by  nocodazole.  It  is  also  possible  that  selective 
toxicity  of  the  SCCl  fragment  in  nocodazole- arrested  cells  is 
caused  by  this  fragment  acting  as  a  competitive  inhibitor  of  the 
ESPl  protease. 

Full-length  SCCl  interacts  with  SMCl,  a  DNA-binding  subunit 
of  cohesin^’^.  Therefore  one  cause  of  lethality  of  overexpressed 
SCC1^^^“^^^  may  be  a  residual  (and  functionally  perturbing)  affinity 
of  this  fragment  for  the  rest  of  the  cohesin  complex.  In  this  model. 
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Figure  1  The  SCC1  fragment  is  a  substrate  of  the  N-end  rule  pathway,  a,  The  in  vivo 
cleavage  sites  in  S.  cerevisiae  S0C^  (ref.  8),  and  the  UPR-type  fusions  that  produce,  on 
co-translational  cleavage  by  DUBs,  the  Flag -tagged  reference  protein  ^DHFR-Ub  and  the 
Flag-tagged  test  proteins  Arg-SCC1^®®“^®®^  or  Met-SCCI^®®"^®®^.  b,  Detection  of  SCCl 
fragment  Control  S.  cerevisiae  ceWs  and  cells  expressing  SCCI"^  were  incubated  with 
specific  dipeptides,  followed  by  immunoblotting  with  anti-Myc  antibody.  Lane  1 ,  control 
cells  (lacking  SCCl  Lane  2,  cells  expressing  SCCl  in  the  absence  of  added  dipeptide. 
Lanes  3-6,  as  lane  2  but  in  the  presence  of  Ala-Arg  (AR),  Arg-Ala  (RA),  Trp-Ala  (WA)  and 
Leu-Ala  (LA),  respectively.  Lane  7,  ubr1A  cells  expressing  SCCr  in  the  absence  of 
dipeptides.  Arrows  indicate  bands  of  SCCl and  SCCI^®^"®®^^.  c,  Pulse-chase  analysis 
of  Arg-SCCI^^^-^^®^.  Cells  expressing  ^DHFR-Ub-Arg-SCCI^®®"^®^^  were  labelled  for 
3  min  with  p^S]methionine,  followed  by  a  chase  for  5, 10  or  20  min,  and  SDS-PAGE 
analysis  of  anti-Flag  immunoprecipitates.  Lanes  1-4,  pulse  chase  in  UBR1  cells.  Lanes 
5-8,  pulse  chase  in  ubr1A  cells.  Arrows  indicate  bands  of  Arg-SCC1^®®“^®®^  and  the 


reference  protein  ^DHFR-Ub.  d,  As  in  c,  but  with  cells  expressing  Met-SCCI^^^"^®®^, 
derived  from  ^DHFR-Ub-Met-SCC1^®^"^®®l  e,  Quantification  of  results  in  c  and  d  by 
Phosphorlmager.  Squares,  Arg-SCC1^®®"^®®^  in  JD52  (UBR1)  cells;  diamonds,  Arg- 
SCCl269-566f  in  JD55  ^ubriA)  cells;  circles,  Met-SCC1  jn  jd52  (UBRTj  cells; 
triangles,  Met-SCCI  jp  [uijriA)  cells.  The  amounts  of  in  the  test  proteins, 
relative  to  in  the  ^DHFR-Ub  reference  protein,  were  plotted  as  percentages  of  this  ratio 
for  Met-SCCI  jp  ^gUg  ^j^pg  2ero.  f,  Immunoblot  analysis,  using  anti-Myc 
antibody,  of  Arg-SCCI^^^-seem  produced  through  ESPl -mediated  cleavage  of  normally 
expressed,  C-terminally  Myc-tagged  SCCr  in  wild-type  (UBR1)  and  ubr1A  cultures 
synchronized  with  a-factor,  as  described®.  The  strains  used  were  S.  cerevisiaey92 
(MATa  ScdMydB)^  and  Y238  {MATa  ScdMydB  ubr1A).  Shown  are  the  time  after 
removal  of  a-factor  (top)  and  the  content  of  budded  (post-G1  phase)  cells  (bottom)  in 
UBR1  and  congenic  ubr1A  cultures. 
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the  function  of  metabolic  instability  of  the  fragment  is 

to  ensure  its  removal  from  the  cohesin  complex,  allowing  re¬ 
binding  of  intact  SCCl  in  the  next  cell  cycle.  To  determine  whether 
Met-SCCl^^^"^^^^  could,  in  fact,  interact  with  SMCl,  extracts  from 
cells  expressing  Flag- tagged  Met-SCCl^^^"^^^^,  either  alone  or 
together  with  RGS/Hisg-tagged  SMCl,  were  precipitated  with 
anti-Flag  antibody,  followed  by  SDS  polyacrylamide  gel  electro¬ 
phoresis  (SDS-PAGE)  of  the  immunoprecipitates  and  immuno- 
blotting  with  either  anti-Flag  or  anti-RGS/Hise  antibodies.  We 
found  that  Met-SCCl^^^'^^^^  was  specifically  co-immunoprecipi- 
tated  with  the  SMCl  (Fig.  2c).  However,  overexpression  of  SMCl, 
from  the  'Pgali  promoter  and  low  copy  plasmid,  did  not  suppress 
the  lethal  effect  of  Met-SCCl^^^~^^^^  overexpression  in  UBRl  cells 
(data  not  shown).  Thus,  the  toxicity  of  overexpressed  SCC1^^^“^^^ 
may  not  be  caused  exclusively  by  sequestration  of  cohesin  com¬ 
plexes. 

Given  the  above  results  (Figs  1  and  2),  might  ubrlA  cells,  which 
lack  the  N-end  rule  pathway  and  in  which  the  Arg-SCCl  fi’agment  is 
long-lived,  be  impaired  in  the  maintenance  of  chromosome  stabil¬ 
ity?  We  used  a  colony  sectoring  assay^^  to  compare  the  rates  of 
chromosome  loss  in  ubrlA  and  UBRl  cells  (Fig.  3a).  Remarkably, 
whereas  less  than  0.5%  of  UBRl  colonies  contained  red  sectors, 
--22%  of  the  ubrlA  colonies  contained  such  sectors  (Fig.  3a;  and 
data  not  shown),  revealing  a  previously  unknown  phenotype  of 
chromosome  instability  in  cells  lacking  the  N-end  rule  pathway.  The 
frequency  of  distinct  red  sectors  (Fig.  3a)  is  roughly  proportional  to 
the  frequency  of  cells  that  suffered  the  loss  of  a  (non-essential) 
marked  chromosome  during  the  growth  of  colonies.  We  conclude 
that  ubrlA  cells  lose  chromosomes  about  100  times  more  frequently 
than  their  congenic  UBRl  counterparts. 

To  determine  how  much  of  the  chromosome-loss  phenotype  of 
ubrlA  cells  was  caused  by  their  inability  to  degrade  specifically  the 


Arg-SCCl  firagment  (as  distinguished  fi*om  other  N-end  rule  sub¬ 
strates),  we  constructed  a  S.  cerevisiae  strain  (in  the  background  of  a 
sectoring  strain^^)  in  which  wild- type  SCCl  was  replaced  by  an  allele 
(expressed  firom  the  native  Pscci  promoter)  that  encoded  SCCl 
bearing  at  its  C  terminus  three  tandem  haemagglutinin  A  (HA)  tags 
and  lacking  the  minor  ESPl  cleavage  site  (Fig.  la),  owing  to  an  Arg 
to  Glu  mutation  at  position  180.  In  this  strain,  the  only  ESPl- 
produced  SCCl  fragment  was  Arg-SCCl We  also  con¬ 
structed  a  derivative  of  this  strain  in  which  Arg-269  of  SCCl -HA 
was  replaced  with  Gly.  The  resulting  ESPl -produced  fragment  of 
SCCl  was  the  long-lived  Gly-SCCl^^^“^^^^  bearing  a  stabilizing  N- 
terminal  residue. 

We  found  that  the  efficiency  of  cleavage  at  the  thus  altered 
cleavage  site  of  SCCl  was  indistinguishable  from  that  at  the  wild- 
type  cleavage  site  (Fig.  3b).  In  the  strain  producing  the  wild-type 
(short-lived)  Arg-SCCl fragment,  the  frequency  of  chromo¬ 
some  loss  was  indistinguishable  from  that  in  the  congenic  wild-type 
strain  (Fig.  3a;  and  data  not  shown).  By  contrast,  in  the  strain  in 
which  the  ESPl -produced  Gly-SCCl^^^“^^^^  fragment  bore  a  stabi¬ 
lizing  N-terminal  residue,  a  greatly  increased  frequency  of  chromo¬ 
some  loss  was  observed,  similarly  to  congenic  ubrlA  cells  expressing 
wild- type  SCCl  (Fig.  3a;  and  data  not  shown).  We  conclude  that  the 
bulk  of  increased  chromosome  loss  in  ubrlA  cells  is  caused  by 
metabolic  stabilization  of  the  ESPl -produced  SCCl  firagment. 

Although  the  steady-state  level  of  ESPl -generated  Arg-SCCl 
was  significantly  (~3-fold)  lower  in  wild-type  (UBRl)  than  in 
ubrlA  cells  (Fig.  3b,  lanes  2  and  3),  the  observed  difference  was  less 
than  would  be  expected  fi-om  the  extremely  short  (~2  min)  half-life 
of  overproduced  Arg-SCCl in  UBRl  cells  (Fig.  Ic,  d).  To 
assess  the  influence  of  the  N-end  rule  pathway  on  the  concentration 
of  endogenous,  ESPl -produced  Arg-SCCl fragment,  we 
monitored  its  levels  by  immunoblotting  as  a  function  of  time, 


®  Flag  IP 

RGS-HiSeSMC1  “T  I  ^ 

^Met-SCC1 269-566  _ 

RGS-His0 


Figure  2  Effects  of  overexpressing  SCC1  and  its  fragments  in  UBR1  and  ubr1A  cells, 
a,  Overexpression  of  iong-lived  SCC1  is  lethal.  Strains  K6843  (UBR1),  JD52  {UBR1) 
or  JD55  {ubr1A)  carrying  plasmids  that  overexpressed,  from  the  Pqali  promoter,  full- 
length  SCCl,  Arg-SCC1^®®"^®®  or  Met-SCC1^®^"^®®^  (left)  were  streaked  either  on 
dextrose-containing  (SD)  plates  (not  shown),  on  which  all  strains  grew,  or  on  galactose- 
containing  (SG)  piates  (right).  The  plates  were  incubated  for  3  d  at  30  °C,  b,  Plating 
efficiency  of  UBR1  cells  that  transiently  expressed  Met-SCC1^®®"^®®  while  being 
transiently  arrested  in  late  G1  phase  (with  a-factor,  aF),  in  S  phase  (with  hydroxyurea, 
HU),  or  in  a  metaphase-like  state  (with  nocodazole,  Nz).  For  each  set  of  assays:  the  first  bar 


refers  to  wild-type  [UBR1)  AVY1 1 0  cells  incubated  in  raffi nose-containing  medium  (YP-R); 
the  second  to  the  same  strain  in  galactose-containing  YP  medium  (YP-G);  the  third  to 
congenic  AVY1 1 1  cells  in  YP-R  that  carried  the  ORF  encoding  Met-SCC1^®^“^^^^ 
downstream  from  the  Pg^^  promoter;  and  the  fourth  to  the  same  strain  in  YP-G.  c,  The 
SCCl  269-566  fragment  interacts  with  SMC1 ,  Extracts  of  S.  cerevisiae  expressing  either 
Met-SCC1^®®"^®®^  (Flag-tagged),  or  RGS/HiSe-tagged  SMC1,  or  both  were  precipitated 
with  anti-Flag  antibody,  followed  by  SDS-PAGE  and  immunoblotting  with  either  anti-Flag 
or  anti-RGS/HiS6.  Asterisk  denotes  a  protein  crossreacting  with  anti-Flag  antibody. 
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using  synchronized  cultures  of  UBRl  and  ubrlA  cells.  The  levels  of 
full-length  SCCl  were  regulated  similarly  in  UBRl  and  ubrlA  cells 
(Fig.  If).  In  a  UBRl  culture,  cells  in  G1  phase  contained  virtually  no 
Arg-SCCl^^^“^^^  fragment,  which  appeared  transiently  as  cells 
entered  anaphase  (Fig.  If).  In  striking  contrast,  ubrlA  cells  con¬ 
tained  the  Arg-SCCl^^^“^^^  fragment  throughout  the  cell  cycle,  and 
in  addition  at  levels  that  significantly  exceeded  the  peak  level  of  Arg- 

SCC1269-566 

Together,  the  short-half  life  of  the  Arg-SCCl^^^“^^^  fragment 
(Fig.  le),  the  toxicity  of  its  long-lived  counterpart  (Fig.  2),  the 
altered  accumulation  pattern  and  steady-state  levels  of  Arg- 
in  ubrlA  cells  (Fig.  If),  the  greatly  increased  chromo¬ 
some  loss  in  ubrlA  cells  (Fig.  3  a),  and  the  evidence  that  most  of  this 
loss  is  caused  by  metabolic  stabilization  of  Arg-SCCl^^^“^^^  allow 
the  following  conclusions.  First,  if  the  Arg-SCCl^^^~^^^  fragment 
becomes  long-lived  (as  it  does  in  ubrlA  cells),  even  the  relatively 
low,  wild-type  level  of  its  expression  is  detrimental  to  the  fidelity  of 
chromosome  maintenance.  Second,  the  metabolic  stabilization  of 
N-end  rule  substrates  other  than  Arg-SCCl^^^“^^^  seems  not  to 
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Figure  3  Increased  chromosome  instability  in  ubr1A  S.  cerevisiae.  a,  Sectoring  assay  for 
the  chromosome  loss  using  the  St/P/ 7-marked  YPH277  (UBR1)  strain  and  its  ubr1A 
derivative  (see  Methods),  b,  Immunoblot  analysis  of  Arg-SCCI and 
Giy-SCC1^®^"^®^^  produced  through  ESP1 -mediated  cleavage  of  normally  expressed, 
C-termina!ly  HA-tagged  SCCl  In  wild-type  and  ubr1A  strains.  Yeast  cuitures  (Tleoo  ^  0.4) 
were  processed  for  immunoblotting  with  anti-HA  antibody  (Covance,  Berkeiey,  CA). 

Lane  1 ,  YPH277  (control)  cells  lacking  the  HA-tagged  SCC1 .  Lane  2,  AVY1 29  {UBR1)  ceils 
expressing  normal  levels  of  SCCf^®°^-HA.  Lane  3,  as  lane  2  but  with  AVY132  (ubr1A) 
cells.  Lane  4,  as  lane  2,  but  with  SCC1  bearing  the  Arg  to  Giy  mutation  at 

position  269.  The  bands  of  full-length  SCC1  -HA  and  its  ESPl  -produced  SCC1 
fragments  are  indicated  on  the  ieft,  c,  S.  cerevisiae  proteins  containing  putative  cleavage 
sites  for  the  ESP1  separin.  The  query  SxExGRx  and  the  Pattern  Match  program 
(http://genome-www2.stanford.edu/cgi-bin/SGD/PATMATCH/nph-patmatch)  were  used 
to  search  the  S.  cerevisiae  genome  database.  Arrows  indicate  the  inferred  sites  of 
cleavage  by  ESP1 . 


contribute  significantly  to  the  elevated  chromosome  loss  in  ubrlA 
cells.  As  to  the  cause  of  toxicity  of  overexpressed 
fragment,  one  possibility,  suggested  by  high  levels  in 

G1 -phase  ubrlA  cells  (Fig.  If),  is  that  the  fragment  interferes  with 
re-formation  of  cohesin  complexes  that  are  required  for  the  estab¬ 
lishment  of  cohesion  during  the  S  phase. 

The  increased  probability  of  chromosome  loss  in  ubrlA  cells, 
although  readily  detectable  through  the  sectoring  assay  (Fig.  3a),  is 
stiU  below  levels  that  would  cause  massive  cell  death  during  growth 
of  a  culture.  In  particular,  no  significant  differences  were  observed 
between  congenic  ubrlA  and  UBRl  cells  in  the  kinetics  of  cell-cycle 
progression  (data  not  shown).  However,  a  further  significant 
increase  in  the  level  of  long-lived  (achieved  through 

its  transcriptional  overexpression)  was  found  to  be  lethal  (Fig.  2a). 
A  parsimonious  interpretation  is  that  the  latter  effect  is  mechan¬ 
istically  similar  to  the  much  less  severe,  viability-compatible  chro¬ 
mosome-loss  phenotype  of  ubrlA  cells,  in  which 
becomes  long-lived  but  is  not  overexpressed  otherwise.  In  this 
interpretation,  the  frequency  of  chromosome  loss  in  cells  that 
overexpress  long-lived  becomes  high  enough  for  them 

to  lose  a  chromosome  nearly  every  division.  Light-microscopic 
observations  indicated  that  individual  cells  divided  once  or  twice 
after  overexpression  of  Met-SCCl^^^“^^^,  followed  by  growth  arrest 
and  a  varying  terminal  phenotype  (data  not  shown). 

Some  physiological  N-end  rule  substrates  other  than  SCCl  may 
also  be  produced  by  the  ESPl  separin.  The  consensus  sequence  of 
the  two  ESPl  cleavage  sites  in  SCCl  is  SxExGR^R  (Fig.  la). 
S.  cerevisiae  REC8,  the  meiosis-specific  counterpart  of  SCCl,  con¬ 
tains  two  near-matches  to  this  consensus®.  REC8  is  cleaved  by  ESPl 
during  yeast  meiosis,  similarly  to  SCCl  in  vegetative  cells^'^,  and 
potential  cleavage  sites  are  also  present  in  Schizosaccharomyces 
pombe  RAD21  and  REC8,  the  fission  yeast  homologues  of  SCCl 
and  REC8  (ref.  8).  Simultaneous  (but  not  single)  alterations  of  these 
sites  in  S.  pombe  RAD21  block  chromosome  segregation,  similarly 
to  the  results  in  S.  cerevisiae^K  A  shorter  consensus,  SxExGR  , 
derived  from  comparisons  of  the  ESPl  cleavage  sites  in  the  budding 
and  fission  yeast  SCCl  homologues®,  was  used  to  search  the  S. 
cerevisiae  genome  database,  yielding  29  putative  substrates  of  ESPl 
other  than  SCCl  and  REC8  (Fig.  3c).  The  N-terminal  residues  of  the 
predicted  C-terminal  fragments  of  these  proteins  encompass  nearly 
the  entire  set  of  12  residues  that  are  destabilizing  in  the  N-end  rule, 
including  tertiary  and  secondary  destabilizing  residues  (Fig.  3c). 

Until  now,  the  known  substrates  of  the  N-end  rule  pathway  that 
are  recognized  through  an  N-degron  were  either  polyprotein- 
derived  viral  proteins^’^^  or  proteins  secreted  by  an  intracytoplasmic 
bacterium  such  as  Listeria  monocytogenes  into  the  mammalian  cell  s 
cytosoF^.  Processing  proteases  that  convert  pro-N-degrons  into  N- 
degrons  have  long  been  envisioned  as  regulated  entry  points  to  the 
N-end  rule  pathway^’^^.  The  identification  of  ESPl  as  one  such  entry 
point,  and  of  SCCl  as  the  first  N-degron-based  physiological  N-end 
rule  substrate  in  yeast,  marks  the  initial  understanding  of  connec¬ 
tions  between  the  N-end  rule  and  the  ESPl  separin.  The  discovery  of 
increased  chromosome  instability  in  ubrlA  cells  suggests  that 
chromosome  aneuploidy  and  related  mutator  phenotypes  charac¬ 
teristic  of  mammalian  cancer  cells  may  also  involve  perturbations  of 
the  N-end  rule  pathway.  Because  ESPl  homologues  are  present  in 
most  if  not  all  eukaryotes^^,  identifying  ESPl  substrates  in  these 
organisms  should  be  an  effective  way  to  pinpoint  physiological 
N-end  rule  substrates,  as  well  as  additional,  currently  unknown, 
functions  of  the  N-end  rule  pathway.  □ 

Methods 

Yeast  strains  and  plasmids 

Strains  S.  cerevisiae  JD52  (MATa  \ys2-801  ura3-S2  trpl-A63  his3-A200  Ieu2-3yll2)  and 
JD55  {ubrlr.HISS  in  the  JD52  background)  have  been  described^^  S.  cerevisiae  K6843 
{MATa  harl  ade2-l  trpl-1  canl-100  leu2-3412  hisS-llJS  Gal-ScclMycl8::URA3) 
expressed  full-length,  C-terminally  Mycig-tagged  SCCl  from  the  Pgalmo  promoter*.  We 
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produced  AVY115  {ubrl::LEU2)y  a  UBRl-lacking  derivative  of  K6843,  using  standard 
techniques.  The  plasmids  p426CUPR-SCCl,  p426CUPR-SCClR,  p426CUPR-SCClM 
and  p426CUPR-SCClN  expressed,  respectively,  full-length  SCCl^,  Arg-SCCl^^^“^^®^,  Met- 
SCCl^^’"^^®^and  SCCl^"^^®^from  the  Pcupi  promoter  as  UPR^^  fusions  of  the  form 
^DHFR-Ub-(SCC1  derivative^  (Fig.  la).  The  Ub  moiety  of  these  fusions  bore  a  Lys  to  Arg 
alteration  at  position  48. 

^DHFR-Ub-Arg-SCCl^*^^''^^^^and  ^DHFR-Ub-Met-SCCl^^^”^^^^were  expressed  from 
the  Pgali  promoter  in  the  plasmids  p416UPRSCClR  and  p416UPRSCClM,  respectively, 
which  were  derived  from  the  low-copy  vector  p416Gall  (ref.  26).  The  N-terminally  RGS/ 
HiSfi-tagged  SMCl  was  expressed  from  the  Pgali  promoter  in  plasmid  p415GalSMCl, 
derived  from  p415Gall  (ref.  26). 

We  cloned  a  fragment  encoding  Met-SCCl^^^“^^^^  downstream  from  the  Pqali  promoter 
into  pRS305  (ref.  25).  The  resulting  plasmid  was  cut  with  Xcml  within  LEU2  and 
transformed  into  S.  cerevis/ae  AVYllO  {MATa  barl-1  ade2-l  Ieu2-3yll2  ura3-l),  yielding 
AVYlll  {leu2::PGALrM^t-SCCl^^^~^^::LEU2).  UBRl  was  deleted  from  the  sectoring- assay 
strain  YPH277  (MATa  ura3-52  ade2-101  trpl-Al  Ieu2-Al  CFVII  (RAD2.d)  URA3 
SUPliy^y  yielding  YPH277HR1  (ubrl::LEU2).  We  also  constructed,  in  the  background  of 
the  pYIplacl28  vector,  the  alleles  of  SCCl  encoding  the  triple-HA  tag  at  the  SCCl  C 
terminus’®  and  either  the  Arg  to  Glu  mutation  at  position  180  or,  in  addition,  the  Arg  to 
Gly  mutation  at  position  269.  The  resulting  plasmids  were  cut  with  BcoRV  within  LEU2 
and  transformed  into  the  diploid  sectoring-assay  strain  AVyi25  (MATa/MATa  SCCl/ 
sccl::Kan^  ura3-52/ura3-52  ade2-101/ade2-101  TRPl/trpl-Al  Ieu2-Al/  Ieu2-Al  CFVII 
(RAD2.d)  URA3  SUPllY^  lacking  one  of  two  copies  of  SCCl.  Haploid  derivatives  of  the 
resulting  transformants,  verified  for  the  presence  of  the  intended  SCCl  alleles,  were  the 
strains  AVY129  (MATa  sccl::Kan^  leu2::SCCl^^^^^::LEU2  ura3-52  ade2-101  trpl-Al  CFVII 
{RAD2.d)  URA3SUP11)  and  AVY130  (MATasccl::Kan^leu2::SCCl^'^‘^^^^^^::LEU2ura3- 
52ade2-101  trpl-Al  CFVII  (RADZd)  URA3SUP11). 

Experiments  with  dipeptides 

We  grew S.  cerevisiae  K6843  (UBRTf  and  AVYl  15  (ubrlA)  at  30  “^C  in  YP  (yeast-peptone) 
medium  containing  2%  raffinose  as  the  carbon  source  to  an  absorbance  at  600  nm 
(Aeoo)  0,5.  Cells  were  arrested  at  metaphase  by  nocodazole  (Sigma),  at  15  |ULgm^^  The 
Pgali-10  promoter  was  induced  by  the  addition  of  galactose  to  2%  for  30  min.  We  added 
specific  dipeptides  (Sigma)  to  5  mM,  and  incubated  2-nil  cultures  for  another  30  min, 
followed  by  the  addition  of  an  equal  volume  of  0.3  M  NaCl,  0.1  M  NaF,  20  mM  EDTA, 

2  mM  NaNa  (pH  8.2).  Cells  were  pelleted  by  centrifugation,  resuspended  in  0.2  ml  of  the 
lysis  buffer  (10%  glycerol,  50  mM  NaOH,  2%  SDS,  5%  p-mercaptoethanol),  and  heated  at 
100°C  for  5  min.  The  suspension  was  titrated  with  1 M  HCl  to  pH  7,  and  centrifuged  at 
12,000^  for  2  min.  Samples  of  the  supernatants  ( 10  p.1)  were  subjected  to  SDS-PAGE  (10% 
gel),  followed  by  immunoblotting  with  a  1:1000  dilution  of  anti-Myc  9E10  antibody 
(Berkeley  Antibody  Co.)  The  blot  was  incubated  with  a  1:2000  dilution  of  the  goat  anti- 
mouse  horseradish  peroxidase  (HRP)  conjugate,  and  was  developed  using  ECL  reagents 
(Amersham). 

Pulse-chase  analysis 

S.  cerevisiae  JD52  and  JD55,  carrying  plasmids  that  expressed  ^DHFR-Ub-Arg- 
Scci269-566f  ^DHPR-Ub-Met-SCCl^^®”^^^^  from  the  Pgali  promoter,  were  grown  at 
30  °C  to  Aeoo  ^  1  in  SR(-ura)  medium  with  auxotrophic  supplements  and  2%  raffinose  as 
the  carbon  source.  Pgali  was  induced  by  adding  galactose,  to  2%  (SRG(-ura)  medium), 
for  1  h.  Cells  from  a  20 -ml  culture  were  gathered  by  centrifugation,  washed  with  0.8  ml 
SRG(-ura),  resuspended  in  0.4  ml  of  SRG(-ura),  and  labelled  for  3  min  with  0.16  mCi  of 
^^S-Express  (NEN),  followed  by  centrifugation  and  resuspension  of  cells  in  SD(-ura) 
containing  4  mM  L-methionine  and  2  mM  L-cysteine.  We  took  0.1  ml  samples  at  the 
indicated  time  points  and  processed  them  for  immunoprecipitation  with  anti-Flag  M2 
agarose  beads  (Sigma),  followed  by  SDS-PAGE,  as  described^^ 

Experiments  with  growth  inhibitors 

We  grew  S.  cerevisiae  AVYl  1 1,  which  expresses  Met-SCCl^®’“^^^^  from  the  Pgali  promoter, 
and  the  congenic  parental  strain  AVYl  10  at  30  ®C  in  YP  medium  containing  2%  raffinose 
as  the  carbon  source  to  A^oo  ~  0.5.  To  separate  samples  of  the  two  cultures,  were  added  a- 
factor  (to  1  ixgml"^),  hydroxyurea  (to  0.1  M)  or  nocodazole  (to  15  |jLgmr^),  followed  by 
incubation  for  2.5  h.  One  half  of  each  of  the  resulting  samples  was  then  incubated  further 
in  the  same  medium  for  1 .5  h;  the  other  half  had  galactose  added  to  2%,  followed  by  a  1 .5-h 
incubation.  Cells  were  gathered  by  centrifugation,  washed  with  water,  and  plated  onto 
dextrose-containing  synthetic-media  (SD)  plates. 

Co-immunoprecipitation/immunoblotting  assay 

JD52  cells  carrying  two  vectors  (pRS415  and  pRS416^^),  or  pRS415  and  p416UPRSCClM 
(expressing  Flag-tagged  Met-SCCl^^^"^*^®^),  or  pRS416  and  p415GalSMCl  (expressing 
RGS/Hisg-tagged  SCMl),  or  p415GalSMCl  and  p416UPRSCClM  together,  were  grown  in 
the  raffinose-containing  SR  medium  to  A^oo  1-  Galactose  was  added  to  2%,  followed  by 
incubation  for  2  h,  preparation  of  extracts^^  immunoprecipitation  with  anti-Flag  M2 
beads,  SDS-PAGE  (10%  gel),  and  immunoblotting,  separately,  with  anti-Flag  and  anti- 
RGS/His6  antibodies  (Qiagen). 


Sectoring  assay 

S.  cerevisiae  YPH277  contained  the  defective  ade2-101  ochre  allele  of  ADE2,  which  was 
suppressed  by  the  SUP  11  suppressor  transfer  RNA  gene,  carried  on  a  non-essential, 
centromere-containing,  ~90-kilobase  chromosome  fragment”.  YPH277  cells  that 
retained  the  SUPJJ -containing  chromosome  formed  white  colonies;  the  loss  of  this 
chromosome  resulted  in  a  red  sector  or  sectors.  The  UBRl  gene  was  deleted  in  the  YPH277 
background,  and  the  resulting  ubrlA  strain  YPH277HR1  was  compared  with  YPH277. 
Cells  were  grown  in  the  uracil-lacking  SD  medium  that  retained  the  SUPJl -containing 
chromosome,  followed  by  plating  on  YPD  plates. 
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The  RPN4  (SON1,  UFD5)  protein  of  the  yeast  Saccharomyces  cer- 
evisiae  is  required  for  normal  levels  of  intracellular  proteolysis. 
RPN4  is  a  transcriptional  activator  of  genes  encoding  proteasomal 
subunits.  Here  we  show  that  RPN4  is  required  for  normal  levels  of 
these  subunits.  Further,  we  demonstrate  that  RPN4  is  extremely 
short-lived  {Un  ^2  min),  that  it  directly  interacts  with  RPN2,  a 
subunit  of  the  26S  proteasome,  and  that  rpn4A  cells  are  perturbed 
in  their  cell  cycle.  The  degradation  signal  of  RPN4  was  mapped  to 
its  N-terminal  region,  outside  the  transcription-activation  domains 
of  RPN4.  The  ability  of  RPN4  to  augment  the  synthesis  of  protea¬ 
somal  subunits  while  being  metabolically  unstable  yields  a  nega¬ 
tive  feedback  circuit  in  which  the  same  protein  up-regulates  the 
proteasome  production  and  is  destroyed  by  the  assembled  active 
proteasome. 

proteolysis  |  ubiquitin  |  N-end  rule  |  UFD  pathway  |  cell  cycle 

The  Saccharomyces  cerevisiae  RPN4  gene  (its  earlier  names 
are  SONl  and  UFD 5)  (1)  was  originally  identified  through 
mutant  rpn4  alleles  that  suppressed  the  growth  defect  of  sec63- 
101  cells,  which  bore  a  temperature-sensitive  (ts)  variant  of 
SEC63,  an  essential  component  of  the  protein  translocation 
channel  in  the  endoplasmic  reticulum  membrane  (2).  More 
recent  studies  have  shown  that  mutations  in  RPN4  inhibit  the 
degradation  of  normally  short-lived  proteins  that  are  targeted  by 
the  N-end  rule  pathway,  by  the  Mbiquitin//usion/^/egradation 
(UFD)  pathway,  and  apparently  also  by  other  pathways  of  the 
ubiquitin  (Ub)-proteasome  system  (3,  4).  These  findings  sug¬ 
gested  that  the  ability  of  rpn4  mutations  to  suppress  the  condi¬ 
tional  lethality  of  sec63-101  may  stem  from  stabilization  of  the 
mutant  but  partially  active  SEC63-101  against  degradation  at 
nonper missive  temperature. 

Regulated  proteolysis  by  the  Ub/proteasome  system  plays 
essential  roles  in  the  cell  cycle,  differentiation,  stress  responses, 
and  many  other  processes  (5-7).  Ub  is  a  76-residue  protein  whose 
covalent  conjugation  to  other  proteins  marks  these  proteins  for 
degradation  by  the  26S  proteasome,  an  ATP-dependent  multi¬ 
subunit  protease.  Ub  conjugation  involves  the  formation  of  a 
thioester  between  the  C  terminus  of  Ub  and  a  specific  cysteine 
of  the  Ub-activating  (El)  enzyme.  The  Ub  moiety  of  El~Ub 
thioester  is  transesterified  to  a  cysteine  in  one  of  several  Ub- 
conjugating  (E2)  enzymes.  The  Ub  moiety  of  E2~Ub  thioester 
is  conjugated  via  the  isopeptide  bond  to  the  e-amino  group  of 
either  a  substrate’s  Lys  residue  or  a  Lys  residue  of  another  Ub 
moiety,  the  latter  reaction  resulting  in  a  substrate-linked 
multi-Ub  chain  (7,  8).  Most  E2  enzymes  function  in  complexes 
with  proteins  called  E3  (9-11).  The  functions  of  E3s  include  the 
initial  recognition  of  degradation  signals  (degrons)  in  substrate 
proteins,  with  different  E3s  recognizing  different  classes  of 
degrons  (12-14).  The  E2-E3  complexes,  referred  to  as  Ub  ligases 
(this  term  is  also  used  to  denote  E3s  alone),  mediate  the 
formation  of  substrate-linked  multi-Ub  chains  (15, 16).  Ubiqui- 
tylated  substrates  are  processively  degraded  by  the  26S  protea¬ 
some,  which  consists  of  the  20S  core  proteasome  and  two  19S 
particles  (17-19).  In  vivo,  the  20S  proteasome  exists  in  complexes 


with  either  the  19S  particle  or  the  US  particle  (of  a  distinct 
protein  composition).  The  latter  particle  stimulates  the  pepti¬ 
dase  but  not  the  protease  activity  of  the  20S  proteasome  (18). 
One  19S  and  one  US  particle  can  be  bound  to  each  end  of  the 
same  20S  proteasome,  a  configuration  of  likely  physiological 
significance  (20,  21).  The  19S  particle  mediates  the  binding  and 
ATP-dependent  unfolding  of  a  substrate  protein  before  its 
transfer  to  the  interior  of  the  20S  core  (17).  The  biogenesis  of  the 
20S  proteasome  has  been  analyzed  in  some  detail  (ref.  22  and 
refs,  therein).  Most,  if  not  all,  of  the  genes  encoding  the 
stoichiometrically  present  subunits  of  the  S.  cerevisiae  26S  pro¬ 
teasome  have  been  identified  (1,  23-26),  but  regulation  of  these 
genes  remains  to  be  understood. 

Consistent  with  the  effects  of  rpn4  mutations  on  Ub/ protea- 
some-dependent  proteolysis,  RPN4  was  reported  to  cofraction¬ 
ate  with  a  partially  purified  26S  proteasome  (27).  However, 
RPN4  was  not  detected  among  proteasomal  subunits  in  other 
analyses  of  purified  26S  proteasomes  (23,  25).  Recent  work 
identified  a  specific  sequence  motif  in  the  promoters  of  yeast 
proteasomal  genes  and  demonstrated  that  RPN4  binds  to  this 
motif  and  functions  as  a  transcriptional  activator  of  the  motif- 
containing  promoters  (28). 

In  the  present  study,  we  showed  that  RPN4  is  required  for 
normal  levels  of  proteasomal  subunits  in  the  cell.  Further,  we 
found  that  RPN4  is  an  extremely  short-lived  protein,  that  it 
directly  interacts  with  at  least  one  specific  subunit  of  the  26S 
proteasome,  and  that  cells  lacking  RPN4  are  perturbed  in  their 
progression  through  post-Gi  phases  of  the  cell  cycle.  We  also 
characterized  the  degron  of  RPN4,  locating  it  outside  the 
putative  transcription-activating  domains.  Our  findings  indicate 
that  the  ability  of  RPN4  to  augment  the  synthesis  of  proteasomal 
subunits  while  being  metabolically  unstable  yields  a  negative 
feedback  circuit  in  which  the  intracellular  proteolysis  is  up- 
regulated  by  a  protein  that  is  destroyed  by  the  assembled  active 
proteasome. 

Materials  and  Methods 

Strains,  Plasmids,  and  /3-Galactosidase  (iSgal)  Assay.  The  5.  cerevisiae 
strains  used  were  EJY140  {MATa  trpl-A63  ura3-52  his3~^200 
leu2-3,  112  lys2-801  rpn4^\\LEU2y,  JD52  {MATa  trpl-l:i.63 
ura3-52  his3-^200  leu2-3,  112  lys2-80iy,  AVY302 
{rpn2l!^:\URA3  derivative  of  JD52);  Y791  {MATa  cim5~l  ura3~52 
his3-ts200  leu2My,  MHY501  {MATa  trpl-1  ura3-52  his3-^200 
leu2-3,  112  lys2-80iy  MHY1409  {MATa  trpl-1  ura3-52  his3- 
tslOO  leu2-3, 112  lys2-801  ubal-2)  (3,  29,  37).  RPNl  containing 
its  promoter  region  was  isolated  as  a  suppressor  of  the  toxicity 


Abbreviations:  ts,  temperature  sensitive;  Ub,  ubiquitin;  jSgal,  f.  co//  j3-galactosidase;  UFD, 
u  b//^us  i  o  n/de  g  ra  d  at  i  o  n . 
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of  overexpressed  N>end  rule  pathway  (30).  PRE6  (containing  its 
promoter  region)  and  the  ORF  of  FFN4  were  amplified  by  PCR 
from  total  DNA  of  S.  cerevisiae  YPH500.  All  constructs  were 
verified  by  DNA  sequencing.  RPNl  and  PRE6  (bearing  their 
promoter  regions)  were  subcloned  into  the  low-copy  vector 
pRS313  (31),  yielding  p313RPNl  and  p313PRE6.  The  RPN4 
ORF  was  subcloned  into  the  low-copy  vector  pRS314CUPl 
derived  from  pRS314  (31),  yielding  p314CUPlRPN4,  in  which 
RPN4  was  expressed  from  the  Pcupi  promoter.  For  immuno- 
blotting  and  immunoprecipitation,  the  flag  epitope  was  added  to 
the  C  termini  of  RPNl,  PRE6,  and  RPN4.  Alternatively,  the  N 
terminus  of  RPN4  was  extended  with  the  ha  epitope  (32).  For 
GST-pulldown  assays,  the  ORFs  of  RPNl,  RPN2,  RPN9,  RPNIO, 
and  RPN12  were  fused  in  frame  to  the  3'-end  of  the  GST-coding 
sequence  in  pGEX-4T-3  (Amersham  Pharmacia).  The  C- 
terminally  flag-tagged  derivative  of  RPN4  was  subcloned  into 
pET-llc  (Novagen).  Escherichia  coli  BL21  (DE3)  was  used  to 
express  GST  fusions,  as  well  as  RPN4-f  lag.  The  E.  coli  lacZ  gene 
encoding  jSgal  lacking  the  first  eight  residues  was  amplified  by 
PCR,  by  using  pMC1871  (Amersham  Pharmacia)  as  a  template. 
This  /3gal  was  expressed  as  a  fusion  to  the  C  terminus  of 
RPN4i_i5i  from  the  Pcupi  promoter  and  pRS315  vector  (31). 

UbK48R,G76A 

was  expressed  from  the  Pcupi  promoter  and  a 
high-copy  vector  (33).  Arg-j3gal,  derived  from  Ub-Arg-j3gal,  was 
expressed  from  the  Pgali  promoter  and  a  high-copy  vector  (3). 
The  activity  of  )3gal  in  yeast  extracts,  from  cultures  at  Aeoo  of 
0.8-1.0,  was  determined  as  described  (32). 

Immunoblotting,  Pulse-Chase,  and  GST-Pulldown  Assays.  S,  cerevisiae 
transformants  were  grown  to  ODeoo  of  0.8 -1.0,  harvested,  and 
resuspended  in  lysis  buffer  (1%  Triton  X-100/0.15  M  NaCl/1 
mM  EDTA/ 50  mM  Na-Hepes,  pH  7.5)  containing  1  x  protease 
inhibitor  mix  (Boehringer  Mannheim),  and  lysed  by  vortexing 
with  glass  beads.  Equal  amounts  of  extracts  were  separated  by 
SDS/PAGE,  followed  by  immunoblotting  with  monoclonal  anti¬ 
flag  antibody  (Sigma)  or  anti-ha  antibody  (Covance,  Berkeley, 
CA).  SDS/PAGE  in  6,  8,  and  12%  gels  was  used  with,  respec¬ 
tively,  RPNl -flag,  RPN4-flag  or  ha-RPN4,  and  PRE6-flag  or 
truncated  RPN4  proteins.  Pulse-chase  procedures  were  as 
described  (30).  Briefly,  10-ml  cultures  (ODeoo  of  0.8 -1.2)  of 
wild- type,  rpn2L,  cim5-l,  and  ubal-2  S.  cerevisiae  in  SD  media 
(30)  containing  0.2  mM  CUSO4  were  labeled  for  5  min  with  0.15 
mCi  of  [^^S]methionine/cysteine  (EXPRESS,  New  England 
Nuclear).  Labeled  cells  were  pelleted,  resuspended  in  0.8  ml  of 
lysis  buffer,  and  lysed  as  above.  The  extracts  were  centrifuged  at 
12,000  X  g  for  10  min,  and  supernatants  containing  equal 
amounts  of  CClsCOOH-insoluble  ^^S  were  used  for  immuno¬ 
precipitation  with  anti-flag,  anti-ha,  or  anti-jSgal  antibodies 
(Promega).  For  binding  assays  with  GST  fusions  (GST-pulldown 
assays),  see  ref.  30  and  the  legend  to  Fig.  \E, 

Cell  Cycle  Analysis.  Flow  cytometric  DNA  analysis  was  performed 
as  described  previously  (34).  Briefly,  exponentially  growing  cells 
(OD600  ^1.0)  were  fixed  in  70%  ethanol  and  treated  with  RNase 
A  (2  mg/ml)  at  37°C  for  2  h.  Cells  were  then  stained  with 
propidium  iodide  (50  /xg/ml)  and  analyzed  by  using  Becton 
Dickinson  FACScan.  Cells  {^2  X  10^)  were  analyzed  in  each 
sample.  For  analyses  of  synchronous  cultures,  cells  were  grown 
to  ODeoo  of  «=^1.0  and  treated  with  a  factor  (5  fxg/ml)  for  2  h, 
then  washed  and  resuspended  in  yeast/peptone/dextrose  (YPD 
medium).  Light  microscopic  determination  of  the  fraction  of 
budded  cells  was  carried  out  with  samples  taken  after  2-h  Gi 
arrest  and  at  20-min  intervals  after  release  from  arrest,  with 
^200  cells  analyzed  from  each  sample.  Synchronized  cultures 
were  also  characterized  by  flow  cytometry. 
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Fig.  1.  RPN4  Is  required  for  normal  expression  of  proteasomal  subunits.  {A 
and  B)  Immunoblot  analysis  of  RPNl  {A)  and  PRE6  (fi)  that  were  C-terminally 
tagged  with  flag  epitope  and  expressed  from  their  native  promoters  and  a 
low-copy  plasmid  in  a  rpn4A  S.  cerevisiae  strain  (lane  2)  and  its  congenic 
wild-type  counterpart  JD52  (lane  3).  Lane  1  in  A  and  B,  cells  transformed  with 
empty  vector.  (Q  Enhanced  expression  of  RPN4  marginally  elevates  the  level 
of  RPNl.  rpn4A  cells  were  cotransformed  with  low-copy  plasmids  expressing, 
respectively,  RPNl -flag  and  RPN4  from  the  Pcupi  promoter.  Increasing  con¬ 
centrations  of  CUSO4  were  used  to  induce  the  expression  of  RPN4  (lanes  3-7). 
Lanes  1  and  2  in  C,rpn4A  cells  transformed,  respectively,  with  empty  RPNl -flag 
vector  and  empty  RPN4  vector.  (D)  Immunoblot  analyses  of  C-terminally 
(RPN4-flag)  or  N-terminally  (ha-RPN4)  tagged  RPN4  that  was  expressed  from 
the  induced  Pcupi  promoter  and  low-copy  plasmid  either  in  wild-type  (RPN4) 
strain  JD52  or  in  AVY302,  a  congenic  rpn2A  mutant  (see  legend  to  Fig.  2  for 
details),  (f)  RPN4  interacts  with  RPN2  in  GST-pulldown  assays.  Extractsof  £.  co// 
expressing  RPN4-flag  were  incubated  with  glutathione-agarose  beads  pre- 
loaded  with  the  indicated  GST  fusions  or  GST  alone.  The  retained  proteins 
were  eluted,  fractionated,  and  immunoblotted  with  anti-FLAG  antibody. 
Approximately  equal  amounts  of  different  GST  fusions  were  immobilized  on 
glutathione-agarose  beads  in  these  assays,  as  verified  by  Coomassie  staining 
(data  not  shown).  SDS/PAGE  in  6,  8,  and  12%  gels  was  used,  respectively,  in 
A  and  C,  inDandE,  and  in  6.  The  asterisk  in  A,  C,  and  D  indicates  a  crossreacting 
band. 


Results 

RPN4  Is  Required  for  Normal  Expression  of  Proteasome  Components. 

We  initially  attempted  to  identify  S.  cerevisiae  RPN4-binding 
proteins  by  using  the  yeast  two-hybrid  assay  and  observed  that 
RPN4  functioned  as  a  transcriptional  activator  when  fused  to  the 
DNA-binding  domain  of  GAL4  (data  not  shown).  This  result 
was  consistent  with  the  finding  that  RPN4  binds  to  a  DNA 
sequence  motif  present  in  the  promoters  of  most  genes  encoding 
proteasomal  subunits  and  several  other  genes  of  the  Ub/pro- 
teasome  system  (28).  We  then  examined  the  levels  of  two  26S 
proteasomal  subunits,  RPNl  (of  the  19S  particle)  and  PRE6  (of 
the  20s  core  proteasome)  in  the  presence  and  absence  of  RPN4, 
by  using  immunoblotting  (Fig.  1^1  and  B).  C-terminally  epitope- 
tagged  RPNl -flag  and  PRE6-flag  were  expressed  from  their 
own  promoters  on  a  low-copy  vector.  The  levels  of  RPNl -flag 
and  PRE6-f  lag  were  significantly  lower  in  a  rpn4!A  strain  than  in 
a  congenic  wild-type  strain  (Fig.  \A  andB,  lanes  2  and  3).  Taken 
together  with  the  gene  expression  data  (28),  these  results 
indicated  that  RPN4  is  a  positive  transcriptional  regulator  of 
genes  encoding  proteasomal  subunits. 

RPN4  Is  a  Short-Lived  Protein  Degraded  by  the  26S  Proteasome. 

RPN4  was  expressed  from  the  copper-inducible  Pcupi  promoter 
on  a  low-copy  vector  in  the  rpn4is  strain  that  also  expressed 
RPNl -flag  from  its  natural  promoter.  The  expression  of  RPN4 
from  uninduced  Pcupi  was  sufficient  to  greatly  augment  the 
expression  of  RPNl-flag,  but  further  enhancement  of  RPN4 
expression  elevated  the  level  of  RPNl-flag  only  slightly  (Fig.  1C, 
lanes  3-7).  In  agreement  with  this  result,  a  strongly  increased 
expression  of  RPN4  in  the  wild-type  {RPN4)  background  re¬ 
sulted  in  at  most  a  slight  enhancement  of  proteasome  activity,  as 
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Fig.  2.  RPN4  is  a  short-lived  protein  degraded  by  a  proteasome-dependent 
pathway.  {A-Q  Pulse-chase  analysis  of  C-terminally  tagged  RPN4  (RPN4-flag) 
that  was  expressed  from  the  induced  Pcupi  promoter  and  low-copy  plasmid 
either  In  wild-type  {RPN4)  strain  JD52  (4),  in  a  congenic  rpn2^  mutant  (B),  or 
In  a  cim5-1  mutant  (Q.  Cells  were  labeled  at  28°C  and  chased  at  28*^0  in  A  and 
B.  and  at  37"C  In  C  The  kinetics  of  RPN4  degradation  in  JD52  were  similar  at 
28  and  37°C  (data  not  shown).  Arrowheads  Indicate  the  band  of  RPN4-f  lag.  (D) 
Quantitation  of  the  patterns  in  A-C,  by  using  Phosphorlmager  (Molecular 
Dynamics)  •,  wild-type  cells.  O,  rpn2A  cells.  A,  dm 5-1  cells.  (E)  Pulse- chase 
analyses  of  Arg-/3gal  (R-/3gal),  derived  from  Ub-Arg-^gal  (3, 14),  and  of  RPN4- 
flag  (expressed  as  in  A)  in  uba  1-2  (37)  and  wild-type  cells.  Quantitation:  •and 
O,  Arg-/3gal  in  wild-type  and  uba1-2  cells,  respectively.  A  and  A,  RPN4-flag  in 
the  same  strains,  (f)  The  same  test  proteins  in  Wild-type  cells  and  cells  over- 
expressing  with  quantitation  on  the  right;  same  designations.  The 

rpn2A  locus,  in  the  strain  AVY302  (see  Materials  and  Methods),  was  a  disrup¬ 
tion  allele.  It  was  produced  through  the  integration  of  URA3  at  the  H/ndlll  site 
of  RPN2  (codon  146)  and  was  identical  to  the  rpn2\\URA3  allele  described  by 
Yokota  et  al.  (36).  The  phenotypes  of  AVY302  and  the  previously  described 
strain  (36)  were  similar  as  well  (data  not  shown).  Two  other  studies  reported 
that  rpn2A  cells  were  inviable  (4,  56).  The  disruption  of  RPN2  in  these  works 
was  carried  out  with  either  TRP1  or  ADE2,  by  using  the  RPN2  BglW  site  at  codon 
38.  It  remains  to  be  determined  whether  different  Integration  sites  account 
for  different  phenotypes  described  for  rpn2A  strains. 


assessed  by  measuring  the  steady-state  levels  of  test  substrates 
that  are  targeted  by  either  the  N-end  rule  pathway  or  the  UFD 
pathway  (data  not  shown),  both  pathways  being  Ub/protea- 
some-dependent  (3,  35).  By  contrast,  expression  of  RFN4  in 
rpn4A  cells  greatly  increased  the  activity  of  these  proteolytic 
pathways  (data  not  shown). 

Given  these  results,  we  expressed  either  a  C-terminally  tagged 
RPN4-flag  or  an  N-terminally  tagged  ha-RPN4  from  the  Pcupi 
promoter  and  determined  the  levels  of  RPN4  protein  at  different 
concentrations  of  the  promoter-inducing  (3uS04.  Neither  of 
these  tagged  derivatives  of  RPN4  could  be  detected  by  immu- 
noblotting  even  on  the  induction  of  Pcupi  (Fig-  suggesting 
that  RPN4  was  a  short-lived  protein,  a  property  that  coiild 
account  for  the  relative  insensitivity  of  RPN4-dependent  pro¬ 
moters  to  the  level  of  transcriptional  activity  of  an  RPN4- 
expressing  gene.  Pulse- chase  assays  were  then  carried  out  with 
RPN4-f  lag  in  wild-type  (RPN4)  cells,  revealing  that  the  in  vivo 
half-life  of  RPN4-f  lag  was  ^2  min  (Fig.  2  A  and  D),  The  half-life 
of  N-terminally  tagged  ha-RPN4  was  indistinguishable  from  that 
of  C-terminally  tagged  RPN4-flag  (data  not  shown),  ruling  out 
the  epitope  tags  as  the  cause  of  RPN4  metabolic  instability.  S. 
cerevisiae  cells  lacking  RPN2,  a  protein  of  the  19S  component  of 
the  26S  proteasome,  are  partially  defective  in  the  degradation  of 
natural  substrates  of  the  proteasome  (36)  (see  also  the  legend  to 
Fig.  2).  Both  RPN4-flag  and  ha-RPN4  could  be  detected  by 
immunoblotting  in  rpnllA  cells,  in  contrast  to  wild-type  {RPN2) 
cells  (Fig.  ID).  Consistent  with  these  results,  pulse-chase  assays 
indicated  a  significant  decrease  in  degradation  of  RPN4-f lag  in 
r/?n2A  cells,  in  comparison  to  wild-type  cells  (Fig.  2B  andD).  In 


addition,  RPN4-flag  was  found  to  be  long-lived  in  the  cim5-l 
mutant  (Fig.  2  C  and  D\  which  bears  a  ts  mutation  in  RPTl,  an 
essential  ATPase  of  the  19S  particle  (29).  We  conclude  that  the 
in  vivo  degradation  of  RPN4  is  mediated  by  the  26S  proteasome. 

RPN4  Interacts  with  RPN2,  a  Component  of  the  19S  Particle.  It  was 

reported  that  RPN4  cofractionated  with  the  26S  proteasome 
(27),  but  RPN4  was  not  observed  in  other  studies  of  purified  26S 
proteasomes  (23,  26).  Previous  work  (30)  used  the  GST- 
pulldown  assay  to  demonstrate  that  UBRl  and  UFD4,  the  Ub 
ligase  (E3)  components  of,  respectively,  the  N-end  rule  and  UFD 
pathways,  directly  interact  with  specific  subunits  of  the  protea- 
some’s  19S  particle.  We  used  the  GST-pulldown  approach  to 
determine  whether  RPN4  might  interact  with  specific  subunits  of 
the  19S  particle.  In  these  experiments,  several  proteins  of  the  19S 
particle  were  expressed  in  E.  coli  as  fusions  to  the  C  terminus  of 
GST.  Extracts  fromE.  coli  expressing  RPN4-f  lag  were  incubated 
with  glutathione-agarose  beads  preloaded  with  GST-RPNl, 
GST-RPN2,  GST-RPN9,  GST-RPNIO,  GST-RPN12,  or  GST 
alone.  The  bound  proteins  were  eluted,  fractionated  by 
SDS/PAGE,  and  immunoblotted  with  anti-flag  antibody. 
RPN4-f  lag  reproducibly  bound  to  GST-RPN2  but  not  to  any  of 
the  other  tested  subunits  of  the  19S  particle  (Fig.  IE).  It  remains 
to  be  determined  whether  RPN2  is  the  only  proteasomal  ligand 
of  RPN4,  or  whether  some  other  proteasomal  subunits,  among 
the  still  untested  ones,  also  interact  with  RPN4. 

Because  the  steady-state  level  of  RPN4  in  wild-type  cells  was 
sufficiently  low  to  be  undetectable  by  standard  immunoblotting 
(Fig.  ID),  RPN4  cannot  be  a  stoichiometric  component  of  the 
19S  particle.  Furthermore,  because  RPN4  is  also  a  transcrip¬ 
tional  regulator  (28),  it  is  clear  that  the  population  of  RPN4 
molecules  in  a  cell  is  dynamically  partitioned  among  several 
classes  of  physiologically  relevant  complexes  that  include  the  26S 
proteasome  and  either  specific  or  nonspecific  RPN4-binding 
sites  on  the  chromosomes. 

Is  Degradation  of  RPN4  Ubiquitin-Dependent?  As  demonstrated 
above  (Figs.  ID  and  2),  RPN4  is  degraded  by  the  26S  protea¬ 
some.  Several  lines  of  evidence  suggested  that  ubiquitylation 
plays  at  most  a  minor  role  in  the  proteasome-dependent  degra¬ 
dation  of  RPN4.  First,  the  ^2-min  half-life  of  RPN4  in  wild-type 
cells  (Fig.  2D)  was  not  significantly  changed  in  the  ubal~2 
mutant,  which  underexpresses  the  Ub-activating  (El)  enzyme 
and  is  therefore  strongly  impaired  in  ubiquitylation  of  proteins 
(37).  Specifically,  whereas  the  degradation  of  Arg-/3gal,  a  sub¬ 
strate  of  the  Ub-dependent  N-end  rule  pathway  (14,  38),  was 
decreased  in  the  ubal-2  mutant,  no  significant  change  was 
observed  with  RPN4-flag  in  this  mutant  (Fig.  2E).  Second, 
overexpression  of  a  Ub  mutant  that  inhibits  the 

formation  of  Lys'^^-linked  multi-Ub  chains,  which  are  essential 
for  a  large  fraction  of  the  proteasome-dependent  proteolysis  (33, 
39),  did  not  significantly  decrease  the  rate  of  RPN4  degradation 
but  did  decrease  the  degradation  of  Arg-j8gal  (Fig.  2E).  Third, 
the  kinetics  of  degradation  of  RPN4-f  lag  in  wild-type  cells  were 
indistinguishable  from  that  in  mutants  lacking  one  of  the  fol¬ 
lowing  Ub-conjugating  (E2)  enzymes:  UBCl,  RAD6  (UBC2), 
CDC34  (UBC3),  UBC4,  UBC5,  UBC6,  UBC7,  or  UBC8  (data 
not  shown).  Analogous  pulse-chase  assays  were  also  carried  out 
with  mutants  lacking  different  pairs  of  these  E2  enzymes.  The 
degradation  of  RPN4  was  slightly  decreased  in  [mZ)c4A  ubc5ts] 
cells,  which  lacked  UBC4  and  UBC5,  two  highly  similar  E2s 
(data  not  shown).  Because  UBC4/UBC5  are  a  functionally 
major  class  of  E2  enzymes  in  S.  cerevisiae  (7),  and  because  [ubc4K 
mZ?c5A]  cells  grow  slowly  and  exhibit  a  number  of  defects,  the 
observed  marginal  stabilization  of  RPN4  in  a  [ubc4iS.  ubc5ti\ 
strain  could  be  an  indirect  result  of  multiple  changes  that  are 
caused  by  the  absence  of  UBC4/UBC5.  Fourth,  the  degradation 
of  RPN4  was  not  decreased  in  5.  cerevisiae  mutants  that  lacked 
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Fig.  3.  The  N-terminal  region  of  RPN4  contains  a  portable  degradation 
signal.  (4)  Diagrams  of  full-length  and  truncated  RPN4.  Two  putative  tran¬ 
scription  activation  domains  are  at  positions  211-229  and  300-315  (hatched 
boxes).  A  putative  bipartite  nuclear  localization  signal  is  at  position  381-399 
(black  box).  A  putative  C2H2  finger  (residues  477-507)  Is  also  Indicated.  (B) 
N-terminally  truncated  RPN4  derivatives  are  long-lived.  RPN4  and  its  trun¬ 
cated  derivatives  were  expressed  from  the  Pcupi  promoter  and  low-copy 
vector  in  the  JD52  {RPN4)  strain.  Relative  steady-state  levels  of  truncated  RPN4 
proteins  (all  of  them  tagged  C-terminally  with  flag  epitope)  were  determined 
by  immunoblotting,  by  using  SDS/12%  PAGE  and  anti-flag  antibody.  The 
bands  of  proteins  with  expected  sizes  are  indicated  by  arrowheads.  (O 
RPN4i-i5rflag  Is  short-lived.  Pulse-chase  assays,  by  using  SDS/12%  PAGE, 
were  carried  out  in  wild-type  {RPN4)  cells  with  RPN4206-53rflag  and 
RPN4i_i5i-flag  (indicated  by  arrowheads)  essentially  as  described  in  Fig.  2, 
except  that  they  were  performed  at  30®C  instead  of  28°C.  (D)  The  1 5 1  -residue 
N-terminal  fragment  of  RPN4  contains  a  portable  degron.  Pulse-chase  assay 
with  RPN4i_i5i-/3gal  fusion  (see  Materials  and  Methods)  expressed  from  the 
Pcupi  promoter  and  low-copy  vector  in  the  JD52  {RPN4)  strain,  by  using 
SDS/6%  PAGE  and  immunoprecipitation  with  anti-)3gal  antibody. 

one  of  the  following  E3  enzymes:  UBRl,  UFD4,  RSP5,  TOMl, 
HUL4,  or  HUL5  (data  not  shown)  (3,  14,  40,  41).  Thus,  it 
appears,  but  remains  to  be  established  definitively,  that  the 
proteasome-dependent  degradation  of  RPN4  (Fig.  2  A  and 
D)  is  either  largely  or  completely  independent  of  RPN4 
ubiquitylation. 

N-Terminal  Region  of  RPN4  Contains  a  Portable  Degron.  The  degra¬ 
dation  signals  of  transcriptional  activators  are  often  located 
within  their  activation  domains  (42-45).  We  constructed  a  set  of 
truncated,  C-terminally  tagged  RPN4  derivatives  that  contained 
either  two,  one,  or  none  of  the  putative  transcription-activation 
domains  of  the  intact  RPN4  (Fig.  3.^4).  Whereas  wild-type  RPN4 
was  undetectable  by  immunoblotting,  because  of  its  rapid  in  vivo 
degradation  (Figs.  ID  and  2  A  and  D),  RPN4  derivatives  such  as 
RPN4206-531?  RPN43 14^531,  and  RPN441 7^531,  which  lacked  at 
least  the  first  205  residues  of  RPN4,  were  readily  detectable  (Fig. 
3B,  lanes  5-8).  By  contrast,  similarly  expressed  truncated  deriv¬ 
atives  that  retained  this  N-terminal  region  of  RPN4  (RPN4i_4i4, 
RPN4i_315,  RPN4i_229j  and  RPN4i_i5i)  were  still  largely  unde¬ 
tectable  by  immunoblotting  (Fig.  3B,  lanes  1-4),  consistent  with 
the  interpretation  that  the  major  degron  of  RPN4  was  located 
within  its  first  150-200  residues.  This  region  does  not  contain 
putative  transcription- activation  domains  (Fig.  3/4). 

Pulse-chase  assays  with  RPN4i_i5i-flag  (containing  the  pre¬ 
sumed  degron  of  RPN4)  and  RPN4206 -531 -flag  (lacking  this 
degron)  confirmed  the  inferences  from  immunoblotting  data: 


RPN4i  -151-flag  was  rapidly  degraded  (ti/2  ^2  min),  whereas 
RPN4206-53i-f  lag  was  long-lived  (Fig.  3C).  The  degron  of  RPN4 
is  portable,  in  that  ligation  of  RPN4i_i5i  to  the  otherwise 
long-lived  115-kDa  jSgal  moiety  resulted  in  a  metabolically 
unstable  RPN4i_i5i-j3gal  protein  (tyz  ^5  min)  (Fig.  3D).  No 
sequence  similarities  between  the  151-residue  region  of  RPN4 
and  the  known  motifs  that  act  as  degrons  in  other  short-lived 
proteins  (13)  could  be  detected,  suggesting  that  this  region  of 
RPN4  contains  a  degradation  signal  which,  to  our  knowledge,  is 
novel.  Interestingly,  the  C-terminally  truncated,  degron- 
containing,  metabolically  unstable  RPN4i_229  did  not  bind  to 
RPN2  of  the  19S  particle  in  the  GST-pulldown  assay  (data  not 
shown),  in  contrast  to  full-length  RPN4  (Fig.  IE),  suggesting  that 
the  in  vivo  degradation  of  intact  RPN4  (ti/2  ^2  min)  may  be 
independent  of  the  demonstrated  RPN4-RPN2  interaction. 

Cells  Lacking  RPI\I4  Exhibit  Delay  in  Post-Gi  Phases  of  the  Cell  Cycle. 

Despite  the  decreased  expression  of  proteasomal  components  in 
cells  (Fig.  1),  they  were  not  only  viable  but  also  similar  to 
congenic  wild-type  cells  in  their  resistance  to  ^260-nm  U  V  light 
and  in  their  ability  to  grow  at  high  temperature  (37'’C),  on  poor 
nitrogen  sources  (proline  and  citrulline),  and  in  the  presence  of 
canavanine  (a  toxic  arginine  analog,  at  1.5  jmg/ml),  NaCl  (at  1 
M),  CUSO4  (at  0.5  mM),  or  ethanol  (at  3%)  (data  not  shown). 
At  the  same  time,  rpn4A.  cells  are  known  to  grow  more  slowly  at 
30°C  on  standard  media  than  wild-type  cells  [(2-4);  unpublished 
data].  To  assess  the  effect  of  RPN4  absence  on  cell  cycle 
progression,  we  carried  out  flow  cytometric  analyses  of  DNA 
content  with  unsynchronized  exponentially  growing  cultures  of 
haploid  rpn4\  and  congenic  wild-type  cells.  The  fraction  of  cells 
with  2N  (replicated)  DNA  content  was  significantly  higher  in 
rpn4A.  culture  (Fig.  4/4).  Light  microscopic  examination  indi¬ 
cated  a  significantly  higher  fraction  of  budded  (post-Gi)  cells  in 
exponentially  growing  rpn4b^  cultures,  in  comparison  to  wild- 
type  ones  (data  not  shown),  suggesting  that  cells  lacking  RPN4 
are  delayed  in  their  progression  through  one  or  more  of  post-Gi 
phases  of  the  cell  cycle. 

To  address  this  question  in  greater  detail,  rpn4ts  and  congenic 
wild-type  cultures  were  synchronized  in  Gi  phase  with  a  factor, 
followed  by  flow  cytometric  analysis  of  cellular  DNA  content  as 
a  function  of  time  after  release  from  Gj  arrest.  As  shown  in  Fig. 
AB,  rpn4b.  cells  exited  Gi  phase  without  delay  relative  to 
wild-type  cells.  However,  it  took  longer  for  rpn4L  cells  to 
progress  through  the  rest  of  the  cell  cycle  (Fig.  AB).  Light 
microscopy  was  used  to  determine  the  fraction  of  budded 
(post-Gj)  cells  in  synchronized  cultures  at  different  times  after 
resumption  of  growth.  Consistent  with  the  results  of  flow 
cytometric  analysis,  the  budding  of  rpn4^  and  wild-type  cells 
occurred  at  approximately  the  same  time  (peak  at  ^80  min)  (Fig. 
4C).  However,  it  took  longer,  on  average,  for  the  budded  rpn4^ 
cells  to  complete  cell  division  («^140  min  for  rpnAHs  cells  versus 
^120  min  for  wild-type  cells).  Note  also  that  a  smaller  fraction 
of  budded  cells  exited  from  M  to  Gi  in  the  rpn4ii  culture  than 
in  the  wild-type  culture  (Fig.  4C),  consistent  with  the  data  for 
unsynchronized  cultures  (Fig.  AA).  Because  the  ratio  of  small-  to 
large-budded  cells  was  essentially  the  same  for  the  synchronized 
wild-type  and  cultures  at  different  time  points  (data  not 

shown),  it  is  likely  that  the  absence  of  RPN4  affects  more  than 
one  post-Gi  step,  e.g.,  both  the  S  ^  G2  and  G2  M  transitions. 

Discussion 

RPN4  was  originally  identified  as  an  extragenic  suppressor  of 
sec63-’101,  a  ts  allele  of  SEC63,  which  is  required  for  the 
translocation  of  proteins  into  the  endoplasmic  reticulum  and  the 
transport  of  proteins  to  the  nucleus  (2).  rpn4  mutations  sup¬ 
pressed  the  ts  growth  defect  oisec63-101  but  did  not  reverse  its 
phenotype  of  mislocalization  of  proteins  that  bore  a  nuclear 
targeting  signal.  Moreover,  rpn4  mutants  themselves  had  a 
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Fig.  4.  Cell  cycle  progression  Is  delayed  after  Gi  phase  in  rpn4A  mutant.  (>4) 
A  larger  fraction  of  rpn4A  cells  have  2N  DNA  content  in  unsynchronized 
cultures.  DNA  flow  cytometry  was  carried  out  with  wild-type  {RPN4)  and 
rpn4A  cells.  The  abscissa  ordinate  indicate,  respectively,  the  number  of  cells 
and  relative  DNA  content.  (B  and  C)  Cell  cycle  progression  of  rpn4A  cells  is 
delayed  after  Gi  phase.  Wild-type  and  congenicrpn4A  cells  were  synchronized 
in  lateGi  by  using  afactor.  Initial  samples  were  taken  after  2  h  of  Gi  arrest,  and 
subsequent  samples  were  taken  at  20-mln  intervals  after  release  from  the 
arrest.  The  samples  were  analyzed  by  using  DNA  flow  cytometry  (B)  and  by 
determining  the  fraction  of  budded  cells  (O-  Data  shown  in  C  are  the  means 
of  triplicate  measurements. 


defect  in  the  nuclear  protein  import  (2).  Later  work  has  shown 
that  mutations  in  RPN4  inhibit  the  activity  of  the  Ub/protea- 
some-dependent  N-end  rule  and  UFD  proteolytic  pathways  (3). 
Our  findings  (Fig.  1  A  and  B)  and  a  recent  study  (28)  demon¬ 
strated  that  RPN4  is  required  for  normal  expression  of  subunits 
of  the  26S  proteasome  and  several  other  components  of  the  Ub 
system.  A  parsimonious  interpretation  is  that  the  observed 
down-regulation  of  the  Ub/proteasome-dependent  proteolysis 
in  rpn4A.  cells  results  primarily  from  decreased  expression  of 
proteasomal  subunits  and  functionally  related  proteins. 

Previous  studies  did  not  reach  a  consensus  on  whether  RPN4 
is  a  component  of  the  26S  proteasome  (23,  24,  26,  27).  A  direct 
and  specific  interaction  between  RPN2  (a  stoichiometric  subunit 
of  the  19S  particle)  and  RPN4  was  demonstrated  in  the  present 
work  (Fig.  IE).  This  result,  together  with  the  earlier  finding  that 
the  E3  enzymes  UBRl  and  UFD4  interact  with  specific  subunits 
of  the  19S  particle  (30)  and  the  evidence  for  interactions  between 
other  E2/E3  enzymes  and  the  26S  proteasome  (23, 46),  indicates 
that  proteasome  is  at  least  a  transient  ligand  of  many  cellular 
proteins. 

An  operationally  useful  definition  of  a  bona  fide  subunit  of  the 
26S  proteasome  should  stipulate  that  a  large  (predetermined) 
fraction  of  the  proteasome  particles  in  a  cell  is  associated  with 
this  subunit  in  vivo.  By  contrast,  and  similarly  to  terminology  in 
the  ribosome  field,  specific  protein  ligands  of  the  proteasome 
that  interact  with  it  transiently  and/or  are  bound  to  small, 
dynamically  determined  subsets  of  the  26S  proteasome,  can  be 
called  proteasome-interacting  proteins.  In  this  terminology, 
RPN4,  UBRl,  and  UFD4  are  proteasome-interacting  proteins, 
as  distinguished  from  the  26S  proteasomal  subunits  such  as 
PRE6,  RPT6,  or  RPN2. 


RPN4  is  not  essential  for  cell  viability  under  normal  condi¬ 
tions.  As  described  above,  cells  are  not  hypersensitive  to 
a  variety  of  physical  and  chemical  stresses,  suggesting  that  a 
strongly  decreased  concentration  of  the  26S  proteasome  in 
rpn4A  cells  is  sufficient  to  maintain  cell  viability  and  growth  even 
under  conditions  of  stress.  At  the  same  time,  given  the  dimin¬ 
ished  activity  of  at  least  the  N-end  rule  and  UFD  pathways  in 
rpn4A  cells  (3),  one  would  expect  an  impairment  of  some 
physiological  functions  that  require  these  pathways.  The  post-Gi 
abnormality  in  the  cell  cycle  progression  of  rpn4 A  cells  described 
in  the  present  work  (Fig.  4)  is  unlikely  to  be  caused  by  inhibition 
of  the  N-end  rule  pathway,  because  ubrlA  cells,  which  lack  this 
pathway,  lose  chromosomes  at  a  greatly  increased  frequency  but 
are  similar  to  congenic  wild-type  cells  in  the  kinetics  of  cell  cycle 
progression  (47).  [The  chromosome-loss  phenotype  of  ubrlA 
cells  is  caused  largely  by  metabolic  stabilization  of  the  ESPl- 
produced  fragment  of  SCCl,  a  prosubstrate  of  the  N-end  rule 
pathway  and  a  component  of  the  chromosome-bound  cohesin 
complex  (47).]  Because  RPN4  functions  as  a  transcriptional 
activator  of  many  nonproteasomal  genes  as  well  (48),  the  cell 
cycle  defect  of  rpn4A.  cells  may  stem  from  down-regulation  of 
these  other  genes.  For  example,  the  expression  of  CDC48,  which 
is  essential  for  the  cell  cycle  progression  and  encodes  an  AAA- 
type  ATPase,  is  decreased  in  rpn4A  cells  (28).  However,  we 
found  that  overexpression  of  CDC48  from  a  heterologous 
(Pmet25)  promoter  did  not  rescue  the  cell  cycle  abnormality  of 
rpn4A  cells  (data  not  shown). 

One  of  our  main  results  is  the  striking  metabolic  instability  of 
RPN4  {tii2  ^2  min).  The  degradation  of  RPN4  is  proteasome- 
dependent  (Fig.  2  A-D).  At  the  same  time,  several  lines  of 
evidence  (Fig.  2  E  and  F;  see  also  above)  suggest  that  degrada¬ 
tion  of  RPN4  is  largely  independent  of  ubiquitylation.  Proteins 
whose  in  vivo  degradation  is  proteasome-dependent  but  Ub- 
independent  include  ornithine  decarboxylase  (49)  and  the  cy- 
clin-dependent  kinase  inhibitor  p21^'P^  (50,  51). 

The  mapping  of  RPN4  degron  localized  it  to  the  first  150 
residues  of  the  531-residue  RPN4  (Fig.  3),  outside  of  its  putative 
transcription  activation  domains.  This  location  of  a  degron  is  an 
exception  to  the  previously  established  pattern  in  which  the 
activation  domains  and  degrons  tend  to  overlap  in  a  transcrip¬ 
tional  regulator  (42-45).  As  to  the  mechanics  of  RPN4  degron, 
the  properties  of  RPN4  suggest  two  possibilities.  The  N-terminal 
degradation  signal  of  RPN4  may  function  as  a  canonical  bipartite 
degron  of  the  Ub/proteasome  system  (12,  13,  52).  The  first 
determinant  of  such  a  degron  is  bound  by  a  degron-specific 
E2-E3  Ub  ligase,  whereas  the  second  determinant  is  an  internal 
Lys  residue  of  a  substrate  protein.  Alternatively,  RPN4  might  be 
targeted  for  degradation  directly  through  its  demonstrated  in¬ 
teraction  with  RPN2,  a  subunit  of  the  19S  particle  (Fig.  IF).  This 
model  is  made  unlikely  (but  not  definitively  precluded)  by  the 
fact  that  the  degron-containing,  short-lived  RPN4i_229  fragment 
was  unable  to  bind  to  RPN2,  in  contrast  to  full-length  RPN4  (see 
above). 

Metabolic  instability  of  RPN4  identifies  it  as  a  member  of  a 
growing  class  of  components  of  the  Ub/proteasome  system  that 
are  also  substrates  of  this  system  and  are  destroyed  by  it,  either 
constitutively  or  conditionally.  One  example  is  UMPl,  a  chap¬ 
erone  that  is  required  for  efficient  assembly  of  the  20S  protea¬ 
some  and  becomes  its  first  substrate  on  the  completion  of 
assembly  (22,  53).  Other  examples  are  S.  cerevisiae  F-box  pro¬ 
teins  CDC4,  GRRl,  and  MET30,  which  are  short-lived  sub¬ 
strate-recognition  subunits  of,  respectively,  SCF^^'^^ 

and  SCF’^^t^o  ub  ligases  (54,  55). 

The  unique  ability  of  RPN4  to  augment  the  transcription  of 
genes  encoding  proteasomal  subunits  while  being  metabolically 
unstable  yields  a  negative  feedback  circuit  in  which  the  same 
protein  up-regulates  the  proteasome  production  and  is  destroyed 
by  the  assembled  active  proteasome. 
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The  N-end  rule  relates  the  in  vivo  half-life  of  a  protein  to  the  identity  of  its  N-temiinal  residue.  N-terminal 
asparagine  and  glutamine  are  tertiary  destabilizing  residues,  in  that  they  are  enzymatically  deamidated  to  yield 
secondary  destabilizing  residues  aspartate  and  glutamate,  which  are  conjugated  to  arginine,  a  primary 
destabilizing  residue.  N-terminal  arginine  of  a  substrate  protein  is  bound  by  the  C/iri -encoded  E3a,  the  E3 
component  of  the  ubiquitin-proteasome-dependent  N-end  rule  pathway.  We  describe  the  construction  and 
analysis  of  mouse  strains  lacking  the  asparagine-specific  N-terminal  amidase  (Nt^-amidase),  encoded  by  the 
Ntanl  gene.  In  vdld-type  embryos, /VSfanJ  was  strongly  expressed  in  the  branchial  arches  and  in  the  tail  and  limb 
buds.  The  Ntanl~^~  mouse  strains  lacked  the  Nt^-amidase  activity  but  retained  glutamine-specific  Nt^- 
amidase,  indicating  that  the  two  enzymes  are  encoded  by  different  genes.  Among  the  normally  short-lived  N-end 
rule  substrates,  only  those  bearing  N-terminal  asparagine  became  long-lived  in  Ntanl~^~  fibroblasts.  The 
Ntanl~^~  mice  were  fertile  and  outwardly  normal  but  differed  from  their  congenic  wild-type  counterparts  in 
spontaneous  activity,  spatial  memory,  and  a  socially  conditioned  exploratory  phenotype  that  has  not  been 
previously  described  with  other  mouse  strains. 


A  multitude  of  regulatory  circuits  involve  conditionally  or 
constitutively  short-lived  proteins  (26,  27,  44,  48,  49,  64).  Fea¬ 
tures  of  proteins  that  confer  metabolic  instability  are  called 
degradation  signals,  or  degrons  (37,  63).  The  essential  compo¬ 
nent  of  one  degradation  signal,  termed  the  N-degron,  is  a 
destabilizing  N-terminal  residue  of  a  protein  (3).  A  set  of 
N-degrons  containing  different  N-terminal  residues  which  are 
destabilizing  in  a  given  cell  yields  a  rule,  termed  the  N-end 
rule,  which  relates  the  in  vivo  half-life  of  a  protein  to  the 
identity  of  its  N-terminal  residue.  An  N-end  rule  pathway  is 
present  in  all  organisms  examined,  from  mammals  and  plants 
to  fungi  and  prokaryotes  (63). 

In  eukaryotes,  an  N-degron  comprises  two  determinants:  a 
destabilizing  N-terminal  residue  and  an  internal  lysine  of  a 
substrate  protein  (4,  32,  60).  The  Lys  residue  is  the  site  of 
formation  of  a  substrate-linked  multiubiquitin  chain  (15,  49). 
The  N-end  rule  pathway  is  thus  one  pathway  of  the  ubiquitin 
(Ub)  system  (25-27).  Ub  is  a  76-residue  eukaryotic  protein 
that  exists  in  cells  either  free  or  covalently  conjugated  to  many 
other  proteins.  The  Ub  system  plays  a  role  in  a  vast  range  of 
processes,  including  cell  growth,  division,  differentiation,  and 
responses  to  stress.  In  most  of  these  processes,  Ub  acts  through 
routes  that  involve  the  degradation  of  Ub-protein  conjugates 
by  the  26S  proteasome,  an  ATP-dependent  multisubunit  pro¬ 
tease  (10,  17,  20,  51). 
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(Throughout  the  text,  the  names  of  mouse  genes  are  in 
italics,  with  the  first  letter  uppercase.  The  names  of  human  and 
Saccharomyces  cerevisiae  genes  are  also  in  italics,  all  uppercase. 
If  human  and  mouse  genes  are  named  in  the  same  sentence, 
the  mouse  gene  notation  is  used.  The  names  of  S.  cerevisiae 
proteins  are  roman,  with  the  first  letter  uppercase  and  an  extra 
lowercase  “p”  at  the  end.  The  names  of  mouse  and  human 
proteins  are  the  same,  except  that  all  letters  but  the  last  “p”  are 
uppercase.  The  latter  usage  is  a  modification  of  the  existing 
convention  [58],  to  facilitate  simultaneous  discussions  of  yeast, 
mouse,  and  human  proteins.  In  some  citations,  the  abbreviated 
name  of  a  species  precedes  the  gene’s  name.) 

The  N-end  rule  has  a  hierarchic  structure.  In  the  yeast  S. 
cerevisiae,  Asn  and  Gin  are  tertiaiy  destabilizing  N-terminal 
residues  in  that  they  function  through  their  conversion,  by  the 
A7M7 -encoded  N-terminal  amidohydrolase  (Nt-amidase),  into 
the  secondary  destabilizing  N-terminal  residues  Asp  and  Glu 
(6).  Destabilizing  activity  of  N-terminal  Asp  and  Glu  requires 
their  conjugation,  by  the  5.  cerevisiae  -encoded  Arg- 
tRNA  protein  transferase  (R-transferase)  (8,  41),  to  Arg,  one 
of  the  primary  destabilizing  residues  (Fig.  lA).  In  mammals, 
the  deamidation  step  is  mediated  by  two  Nt-amidases,  Nt*^- 
amidase  and  Nt^-amidase,  which  are  specific,  respectively,  for 
N-terminal  Asn  and  Gin  (Fig.  lA)  (24,  59).  The  mammalian 
counterpart  of  the  yeast  R-transferase  Ate  Ip  exists  as  two 
distinct  species,  ATEl-lp  and  ATElp-2,  which  are  produced 
through  alternative  splicing  oiAtel  pre-mRNA  (34).  Inverte¬ 
brates,  the  set  of  secondary  destabilizing  residues  contains  not 
only  Asp  and  Glu  but  also  Cys,  which  is  a  stabilizing  residue  in 
yeast  (Fig.  lA)  (18,  23).  The  primary  destabilizing  N-terminal 
residues  are  bound  directly  by  the  UBRi -encoded  N-recognin, 
the  targeting  (E3)  component  of  the  N-end  rule  pathway.  In  S. 
cerevisiae,  Ubrlp  is  a  225-kDa  protein  which  recognizes  poten¬ 
tial  N-end  rule  substrates  through  its  type  1  and  type  2  sub- 
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no.  1.  Deletion-disruption  of  the  mouse  Ntani  gene.  (A)  Comparison  of  enzymatic  reactions  that  underlie  the  activity  of  tertiary  and  secondary  destabilizing 
residues  in  the  yeast  S,  cerevisiae  and  the  mouse.  N-terminal  residues  are  indicated  by  single-letter  abbreviations  for  amino  acids.  The  ovals  denote  the  rest  of  a  protein 
substrate.  The  /Vrani -encoded  mammalian  Nt^-amidase  converts  N-terminal  Asn  to  Asp.  N-terminal  Gin  is  deamidated  by  Nt*^-amidase,  which  remains  to  be  isolated 
(see  text).  In  contrast,  the  yeast  Nt-amidase  Ntalp  can  deamidate  either  N-terminal  Asn  or  Gin  (6).  The  secondary  destabilizing  residues  Asp  and  Glu  are  arginylated 
by  the  mammalian  ATEl-lp  or  ATEl-2p  R-transferase  (34).  A  Cys-specific  mammalian  R-transferase  (23)  remains  to  be  identified.  N-terminal  Arg,  one  of  the  primary 
destabilizing  residues,  is  recognized  by  N-recognin,  the  E3  component  of  the  N-end  rule  pathway  (63).  (B)  Targeting  strategy.  Top,  partial  restriction  map  of  the  mouse 
Ntani  gene;  middle,  structure  of  the  targeting  vector;  bottom,  structure  of  the  deletion-disruption  Mani'  allele.  Exons  are  denoted  by  solid  vertical  bars.  The  directions 
of  transcription  of  the  neo  and  tk  genes  are  indicated.  Homologous  recombination  resulted  in  the  replacement  of  the  Ntani  exons  2  to  5  with  the  neo  cassette.  Probes 
for  Southern  hybridization  are  indicated  by  solid  rectangles.  Restriction  sites:  Xh,  Xho\\  R,  EcoRI;  Bl,  BarnHV,  H,  ifmdIII;  P,  Pstl.  (C)  Southern  analysis  of 
BamHI-digested  tail  DNA  from  wild-type  (+/+),  heterozygous  {Ntanl^^-\  and  Ntanl~^-  mice.  The  5'  probe  yielded  the  12-  and  1.7-kb  Ntani  fragments  for  the 
wild-type  (wt)  and  mutant  (mut)  Ntani  alleles,  respectively;  the  3'  probe  detected  12-  and  7-kb  fragments.  The  organization  of  the  deletion-disruption  allele  was 
independently  verified  by  Southern  analysis  of  the  A/ioI-//wdIII-digested  tail  DNA  (data  not  shown).  (D)  PCR  analysis  of  tail  DNA.  The  primers  were  5'-GCCAC 
TTGTGTAGCGCCAAGTGCCAGC  (for  neo,  forward),  5'-CTTCCCACCAAGCCTGACTGTTGATC  (for  Ntani,  forward)  and  S'-CTTCAATTTCTGTGCTCAG 
CTAAGCTC  (for  Ntani,  reverse).  (E)  RT-PCR  analysis  of  the  total  RNA  isolated  from  +/+  and  Ntani EF  ceils,  using  primers  PI  (for  exon  1),  P2  (exon  2),  P3 
(exon  6),  P4  (exon  5),  and  P5  (exon  10).  p-Actin  mRNA  was  used  as  a  control,  at  the  20-fold-lower  primer  concentration  in  comparison  to  other  lanes. 
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strate-binding  sites.  The  type  1  site  binds  the  basic  N-terminal 
residues  Arg,  Lys,  and  His.  The  type  2  site  binds  the  bulky 
hydrophobic  N-terminal  terminal  residues  Phe,  Leu,  Trp,  Tyr, 
and  He  (35,  63).  Ubrlp  contains  yet  another  substrate-binding 
site  that  targets  proteins  such  as  Cup9p  and  Gpalp,  which  bear 
internal  (non-N-terminal)  degrons  (12,  54).  The  Ubrl  genes 
encoding  mouse  and  human  N-recognins,  also  called  E3a, 
have  been  cloned  (36),  and  mouse  strains  lacking  Ubrl  have 
recently  been  constructed  (Y.  T.  Kwon  and  A.  Varshavsky, 
unpublished  data). 

The  known  functions  of  the  N-end  rule  pathway  include  the 
control  of  peptide  import  in  S.  cerevisiae,  through  the  degra¬ 
dation  of  Cup9p,  a  transcriptional  repressor  of  PTR2,  which 
encodes  the  peptide  transporter  (1,  12);  a  mechanistically  un¬ 
defined  role  in  regulating  the  Slnlp-dependent  phosphoryla¬ 
tion  cascade  that  mediates  osmoregulation  in  S.  cerevisiae  (47); 
the  degradation  of  alphaviral  RNA  polymerases  and  other 
viral  proteins  in  infected  metazoan  cells  (19,  38);  and  the 
degradation  of  Gpalp,  a  Ga  protein  of  5.  cerevisiae  (43,  54). 
Physiological  N-end  rule  substrates  were  also  identified  among 
the  proteins  secreted  into  the  mammalian  cell’s  cytosol  by 
intracellular  parasites  such  as  the  bacterium  Listeria  monocy¬ 
togenes.  Short  half-lives  of  these  bacterial  proteins  are  required 
for  the  efficient  presentation  of  their  peptides  to  the  immune 
system  (56).  Inhibition  of  the  N-end  rule  pathway  was  reported 
to  interfere  with  mammalian  cell  differentiation  (28)  and  to 
delay  limb  regeneration  in  amphibians  (61).  Studies  of  the 
Ub-dependent  proteolysis  of  endogenous  proteins  in  muscle 
extracts  suggested  that  the  N-end  rule  pathway  plays  a  role  in 
catabolic  states  that  result  in  muscle  atrophy  (39,  57).  A  crush 
injury  to  the  rat  sciatic  nerve  was  reported  to  result  in  a  '-'10- 
fold  increase  in  the  rate  of  arginine  conjugation  to  the  N 
termini  of  proteins  in  the  nerve’s  region  upstream  of  the  crush 
site,  suggesting  an  injury-induced  increase  in  the  concentration 
of  R-transferase  substrates  and/or  an  enhanced  activity  of  the 
N-end  rule  pathway  (65). 

Physiological  substrates  of  either  yeast  or  metazoan  Nt-ami- 
dases  and  R-transferases  are  unknown.  Engineered  N-end  rule 
substrates,  including  substrates  of  Nt-amidases  and  R-trans¬ 
ferases,  can  be  produced  in  vivo  through  the  Ub  fusion  tech¬ 
nique,  in  which  a  Ub-X  reporter  fusion  is  cleaved  by  deubiq- 
uitylating  enzymes  (DUBs)  (66)  after  the  last  residue  of  Ub, 
yielding  a  reporter  bearing  the  desired  N-terminal  residue  X 
(3,  63). 

The  mouse  Asn-specific  Nt  -amidase  is  encoded  by  the 
17-kb  Ntanl  gene.  The  1.6-kb  Ntanl  mRNA  specifies  the  310- 
residue  Nt’^-amidase  (24).  In  the  present  work,  we  character¬ 
ized  the  expression  and  intracellular  localization  of  Nt^-ami- 
dase.  We  also  constructed  mouse  strains  bearing  a  homozygous 
deletion-disruption  of  Ntanl  and  showed  that  these  mice 
lacked  both  Nt^-amidase  and  the  Asn-specific  branch  of  the 
N-end  rule  pathway.  The  Ntanl~'~  mice  were  fertile  and  out¬ 
wardly  normal  but  were  found  to  differ  from  their  congenic 
wild-t^e  counterparts  in  spontaneous  activity  and  spatial 
memory.  Among  these  differences  was  a  socially  conditioned 
exploratory  phenotype  of  Ntanl~'~  mice  that  has  not  been 
previously  described,  to  our  knowledge,  with  other  mouse 
strains. 

MATERIALS  AND  METHODS 

Construction  of  mouse  strains  lacking  Nt^-amidase.  Genomic  Ntanl  DNA 
fragments  were  subcloned  from  the  PI  phage  DNA  containing  the  strain  C129- 
derived  mouse  Ntanl  gene  (24).  The  1,462-bp  /fmdlll-jPjrt  fragment  containing 
exon  6  and  its  flanking  introns  was  used  as  the  vector’s  short  homology  arm.  The 
7.2-kb  Xhdl-Notl  fragment  containing  exon  1  and  its  flanking  intron  was  used  as 
the  long  homology  arm.  To  construct  the  targeting  vector,  a  1,462-bp 
HindlU-Pstl  fragment  containing  exon  6  and  its  flanking  introns  (the  vector’s 


short  arm)  and  the  phosphoglycerate  kinase  gene  (PGX)-thymidine  kinase  (tk) 
cassette  of  pPNT  (a  gift  from  R.  C.  Mulligan,  Harvard  Medical  School,  Boston, 
Mass.)  were  inserted  into  pPGKRN  containing  the  wild-type  PGK-neo  (neomy¬ 
cin  resistance  gene)  cassette  (a  gift  from  R.  Jaenisch,  Whitehead  Institute, 
Cambridge,  Mass.),  yielding  pPGK-SA.  The  vector’s  7.2-kb  long  arm  was  pro¬ 
duced  by  joining,  within  pPGK-SA,  a  4.8-kb  Xhol-EcoKl  fragment  to  its  flanking 
2.6-kb  EcoRI-L'coRI  fragment  containing  exon  1  and  flanking  intron  pPGK-SA, 
yielding  the  Ntanl  targeting  vector  pNTANl-KO  (Fig.  IB).  The  J^ol-linearized 
targeting  vector  (Fig.  IB)  was  electroporated  into  CJ7  embiyonic  stem  (ES)  cells. 
Selection  with  G418  (at  0.4  mg/ml)  and  l-(2'-deoxy,  T  fluoro-p-D-arabino- 
furanosyl)-5-iodouracil  (FIAU;  at  0.4  jxM)  was  started  24  h  after  electroporation. 
Correctly  targeted  ES  cells  were  identified  by  PCR  and  Southern  hybridization 
with  the  5'  and  3'  probes  (Fig.  1C  and  D).  Cells  of  the  12  independent  ES  cell 
clones  were  injected  into  3.5-days-postcoitum  C57BL/6J  blastocysts.  The  result¬ 
ing  male  chimeras  were  bred  with  C57BL/6J  females  to  test  for  germ  line 
transmission  of  the  mutated  Ntanl  gene.  The  NtanU'~  mice  resulting  from  this 
cross  (6  out  of  12  independent  ES  clones  were  found  to  populate  germ  line  in 
these  tests)  were  intercrossed  to  produce  Ntanl~^~  mice.  Alternatively,  the 
initial  male  chimeras  were  mated  with  129/SvEv  females,  yielding,  through  the 
analogous  series  of  steps,  Ntanl~^~  mice  in  the  strain  129  background.  All  of  the 
behavioral  tests  were  carried  out  with  Ntanl~^~  mice  in  the  strain  129  back¬ 
ground.  For  genotyping,  the  tail-derived  DNA  was  analyzed  by  PCR,  or  digested 
with  either  RamHI  or  HindllUXhol,  and  analyzed  by  Southern  hybridization.  A 
0.75-kb  Pstl  fragment  and  a  0.9-kb  Pstl  fragment  (indicated  in  Fig.  IB)  were  used 
as  the  5'  and  3'  hybridization  probes,  respectively. 

Northern  and  RT-PCR  analyses  of  RNA.  Total  RNA  was  isolated  from  brain, 
testis,  and  embryonic  fibroblasts  (EF  cells)  of  wild-type  and  Ntanl  mice  as 
described  elsewhere  (34).  RNA  was  fractionated  by  electrophoresis  in  1%  form- 
aldehyde-agarose  gels,  blotted  onto  Hybond  N'*'  (Amersham),  and  hybridized 
with  ^^P-labeled  probes  specific  for  different  regions  of  the  Ntanl  cDNA  (Gen- 
Bank  accession  no.  U57692):  probe  a,  nucleotide  (nt)  34  to  900;  probe  b,  nt  118 
to  450;  probe  c,  nt  118  to  900;  probe  i  nt  34  to  450;  probe  e,  nt  470  to  670;  and 
probe  f,  nt  680  to  900.  Alternatively,  reverse  transcription-PCR  (RT-PCR)  was 
carried  out  (2),  with  first-strand  cDNA  synthesized  using  Superscript  II  poly¬ 
merase  (GIBCO,  Frederick,  Md.).  PCR  was  done  using  primers  specific  for 
different  regions  of  the  Ntanl  cDNA:  primer  PI,  5'-ATGCCACTGCTGGTGG 
ATGG-GCAG  (forward);  P2,  5'-GAGCCAGACITCTCAGAGGTCAG  (for¬ 
ward);  P3,  5'-GACA-TTCACITAGTGACATrATG  (forward);  P4,  5'-TTCTG 
TGACAGCTGCCTGTCATC  (reverse);  and  P5,  5'-CATCAAGGTAGATCTA 
ATATGTTC  (reverse).  The  ratio  of  mouse  Atel-1  and  Atel-2  mRNAs  was 
determined  as  described  elsewhere  (34). 

Whole-mount  in  situ  hybridization.  Wild-type  and  Ntanl~^~  embryos  were 
staged,  fixed,  and  processed  for  in  situ  hybridization  as  described  elsewhere  (16). 
A  0.3-kb  fragment  of  the  Ntanl  cDNA  (nt  108  to  448)  that  was  encompassed  by 
the  deleted  region  in  the  Ntanl~^~  allele  was  subcloned  mto  Xba\-Xho\  sites  of 
pBluescript  11 SK"^  (Stratagene),  and  the  resulting  plasmid,  pMR27,  was  used  as 
a  template  for  synthesizing  antisense  RNA  probe  labeled  with  digoxigenin 
(Roche  Molecular  Biochemicals,  Indianapolis,  Ind.). 

Localization  of  NTANlp-GFP.  The  mouse  Ntanl  open  reading  frame  (ORF) 
was  subcloned  into  the  Xhdl  and  Agel  sites  of  pEGFP-Nl  (Clontech),  yielding 
pNTANl-GFP,  which  expressed  the  NTANlp  green  fluorescent  protein  (GFP) 
fusion  from  the  cytomegalovirus  promoter.  NIH  3T3  cells  (ATCC  1658-CRL) 
were  grown  as  monolayers  in  Dulbecco’s  modified  Eagle  medium  (DMEM; 
Gibco-BRL)  supplemented  with  10%  fetal  bovine  serum.  Cells  were  grown  to 
—15%  confluence  on  glass  coverslips  for  24  h  prior  to  transfection  with  either  the 
GFP-expressing  control  vector  pEGFP-Nl  or  pNTANl-GFP,  using  Lipo- 
fectamine  (GIBCO)  and  the  manufacturer-supplied  protocol.  Cells  were  incu¬ 
bated  for  5  h  at  37°C  in  serum-free  DMEM  containing  DNA  and  Lipofectamine. 
An  equal  volume  of  medium  containing  20%  serum  was  then  added,  and  the  cells 
were  grown  for  another  12  to  20  h  at  37‘’C.  Cells  were  fixed  with  2%  formalde¬ 
hyde  in  phosphate-buffered  saline,  and  GFP  fluorescence  was  examined  in  a 
Zeiss  Axiophot  microscope. 

Immortalization  of  EF  cells,  transfection,  and  pulse-chase  analysis.  EF  cells 
were  isolated  from  either  Ntanl~^~  or  +/+  13.5-day-old  (el3.5)  embryos  as 
described  elsewhere  (52)  and  grown  in  DMEM-F-12  (GIBCO)  supplemented 
with  15%  fetal  bovine  serum,  antibiotics,  and  2  mM  L-glutamine.  They  were 
immortalized  using  simian  virus  40  (14)  and  then  transiently  transfected,  using 
Lipofectamine  PLUS  (GIBCO),  with  plasmid  pRC/dhaUbXnsP4pgal,  which  ex¬ 
pressed  DHFR  (dihydrofolate  reductase)-HA  (hemagglutinin)-Ub^'*^-X- 
nsP4(3gal  test  proteins  (X  =  Met,  Asn,  Gin,  or  Arg)  (40)  (see  Results).  Cells  were 
labeled  with  Tran^^S-label  (New  England  Nuclear,  Boston,  Mass.),  followed  by 
a  chase  for  0, 1,  and  2  h  in  the  presence  of  cycloheximide,  preparation  of  extracts, 
immunoprecipitation,  sodium  dodecyl  sulfate-polyacrylamide  gel  electrophoresis 
(SDS-PAGE),  autoradiography,  and  quantitation  with  Phosphorlmager,  essen¬ 
tially  as  described  elsewhere  (40). 

X-DHFR  test  proteins.  ^^S-labeled  X-DHFR  proteins  (X  =  Asn,  Gin,  or  Asp) 
were  prepared  as  described  elsewhere  (24).  Briefly,  plasmids  pSG4,  pSG41,  and 
pSG44,  expressing,  respectively,  Ub-Asn-DHFR,  Ub-Asp-DHFR,  and  Ub-Gln- 
DHFR  from  the  P^rc  promoter  (24),  were  transformed  into  Escherichia  coli 
JMlOl  carrying  pJT184,  which  expressed  Ubplp,  a  DUB  of  S.  cerevisiae  (62). 
Cells  of  a  50-ml  culture  grown  at  3TC  to  an  A^qq  of  ~0.9  in  Luria  broth  plus 
ampicillin  (40  |xg/ml)  and  chloramphenicol  (20  |JLg/ml)  were  pelleted  and  resus- 
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pended  in  50  ml  of  M9  supplemented  with  glucose  (0.2%),  thiamine  (2  |xg/ml), 
ampicillin  (40  jig/ml),  1  mM  isopropylthio-p-D-galactoside  (IPTG),  and  methi¬ 
onine  assay  medium  (Difco).  The  suspension  was  shaken  for  1  h  at  37°C,  fol¬ 
lowed  by  the  addition  of  1  mCi  of  Tran^^S-label  (ICN)  and  further  incubation  for 
1  h  at  37°C.  Unlabeled  L-methionine  was  then  added  to  1  mM,  and  shaking  was 
continued  for  another  10  min.  The  cells  were  harvested,  and  lysates  were  pre¬ 
pared  by  the  addition  of  lysozyme  and  Triton  X-100,  followed  by  centrifugation 
(24).  The  supernatant,  containing  [^^S]X-DHFR  (X  =  Asn,  Asp,  or  Glu)  was 
purified  by  affinity  chromatography  on  a  methotrexate  column  (Pierce)  (0.5-ml 
bed  volume).  p^S]X-DHFRs  were  examined  by  SDS-PAGE  and  found  to  be 
greater  than  95%  pure. 

lEF  assay  for  amidase  activity.  Extracts  from  wild-type  or  NtanJ~^~  EF  cells 
were  prepared  by  homogenization  in  a  mixture  containing  0.01%  Triton  X-100, 
10%  glycerol,  125  mM  KCl,  7.5  mM  MgCl2,  1  mM  dithiothreitol,  0,25  mM 
EDTA,  0.2  mM  phenylmethylsulfonyl  fluoride,  130  mM  Tris-HCl  (pH  7.5),  and 
the  protease  inhibitors  antipain,  chymostatin,  leupeptin,  pepstatin,  and  aprotinin 
(Sigma),  each  at  25  jxg/ml.  For  the  deamidation  assay,  5  )xl  of  ^^S-labeled 
X-DHFR  (0.5  mg/ml  in  storage  buffer;  X  =  Asn,  Asp,  or  Glu)  was  mixed  with  20 
(xl  of  the  extract,  incubated  for  2  h  at  37°C,  and  placed  on  ice.  Samples  (5  |jl1) 
were  applied  onto  isoelectrofocusing  (lEF)-PAGE  plates  (pH  3.5  to  9.5;  Phar¬ 
macia,  Piscataway,  N.J.)  precooled  to  10°C.  lEF  was  carried  out  for  80  min  at  30 
W  in  a  cooled  lEF  apparatus  (Hoefer,  San  Francisco,  Calif.).  The  plates  were 
soaked  in  100  ml  of  10%  CCl3COOH-5%  5-sulfosalicylic  acid,  stained  with 
Coomassie  blue  to  detect  lEF  markers  (Pharmacia),  and  autoradiographed.  ^“^S 
in  the  bands  of  more  acidic  (deamidated)  and  more  basic  (initial)  X-DHFR 
species  was  quantitated  with  a  Phosphorlmager  (Molecular  Dynamics,  Sunny¬ 
vale,  Calif.). 

Behavioral  tests.  Mice  were  kept  on  a  0600-  to  1800-h  light  cycle  and  tested 
during  the  light  period.  For  the  rotarod,  weight  retention,  coat  hanger,  and 
platform-leaving  tests,  strain  129  Ntanl~^~  mice  and  their  congenic  +/+  litter- 
mates  (produced  through  matings  of  Ntanl~‘'^  mice)  were  used.  Nonlittermates 
were  also  used  for  the  platform-leaving  test  (see  below).  For  the  shuttlebox  and 
passive  avoidance  tests,  the  elevated  plus-maze  test,  the  open-field  test,  the 
Morris  maze,  the  radial  arm  maze,  and  the  Lashley  maze,  strain  129  +/+  and 
congenic  Ntanl~'~  mice  (produced,  respectively,  through  4-/-I-  X  and 
Ntanl~^~  X  Ntanl~'~  matings)  were  used.  The  same  series  of  tests  were  re¬ 
peated  6  weeks  or  more  after  the  initial  experiment,  using  both  male  and  female 
mice.  To  determine  whether  the  Ntanl  genotype  affected  long-term  memory, 
mice  were  retested  in  the  Morris  maze,  Lashley  maze,  and  shuttlebox  avoidance 
test  at  least  7  weeks  after  their  original  learning.  The  data  were  evaluated  by 
analysis  of  variance.  Significance  was  set  at  the  0.05  level;  all  behavioral  differ¬ 
ences  described  in  Results  were  at  this  or  a  higher  level  of  statistical  significance. 

(i)  Rotarod  test.  The  apparatus  (UGO,  Basile,  Italy)  consisted  of  a  motor- 
driven  rod  3  cm  in  diameter  that  carried  five  compartments,  separated  by  walls 
25  cm  in  height.  One  pair  of  littermates  was  placed  on  the  rotating  rod  in  each 
compartment.  Mice  were  placed  on  the  rod  at  either  10  or  20  rpm;  the  cutoff  time 
was  2  min.  Each  mouse  was  tested  on  the  rod  sue  times,  at  1-h  intervals. 

(ii)  Weight  retention.  Mice  were  suspended  by  the  tails  and  made  to  grab  a 
wire  loop  (wire  diameter,  2  mm;  weight,  40  g)  70  cm  above  the  floor.  Time 
elapsed  before  the  animal  released  the  loop  was  measured.  Each  mouse  was 
tested  five  times  a  day,  at  1-h  intervals,  for  a  total  of  3  days. 

(iii)  Coat  hanger  test.  A  wire  coat  hanger  (wire  diameter,  2  mm;  length  of  side 
bars,  29  cm;  length  of  the  horizontal  bar,  45  cm)  suspended  75  cm  above  the  table 
was  used  to  examine  motor  coordination  (11).  A  trial  began  by  allowing  the 
mouse  to  grab  the  middle  of  the  horizontal  bar  with  two  front  paws.  The  time 
elapsed  before  the  animal  grabbed  the  horizontal  bar  with  all  four  paws  and  the 
time  before  the  two  front  paws  reached  one  of  the  side  bars  were  determined. 
Each  mouse  in  the  trial  was  tested  twice  a  day,  at  6-h  intervals,  for  9  days. 

(iv)  Shuttlebox  avoidance,  passive  avoidance,  and  elevated  plus-maze  tests. 
For  the  shuttlebox  and  passive  avoidance  tests,  the  Gemini  avoidance  system 
(San  Diego  Instruments,  San  Diego,  Calif.)  was  used.  The  procedures  are  de¬ 
scribed  in  Results. 

(v)  Open-field  test.  For  the  open-field  activity  measurements,  an  Omnitech 
Digiscan  animal  activity  monitor  was  used.  Each  mouse  was  placed  into  a  square 
chamber  (20  by  20  cm)  for  one  9-min  session,  and  the  movements  were  tabulated 
along  the.x  andy  axes,  using  infrared  sensors. 

(vi)  Morris  maze.  A  mouse  was  placed  into  a  circular  black  water  tub  (diam¬ 
eter,  123  cm;  temperature,  19  to  21'’C)  from  one  of  the  four  quadrants  and  had 
to  find  a  platform  (diameter,  23  cm)  submerged  1  cm  below  the  surface  of  water 
and  set  23  cm  from  the  wall.  There  were  no  cues  within  the  maze,  and  so  the 
animal  had  to  use  extramaze  spatial  cues  in  guiding  itself  to  the  platform  in 
repeated  trials  (46).  An  animal  was  given  four  trials  per  day  (maximum  of  45  s 
per  trial),  one  from  each  of  the  four  locations,  their  order  determined  quasi- 
randomly.  Each  mouse  was  tested  for  5  days,  with  the  platform  in  a  fixed  position. 
On  day  6,  the  platform  was  moved  to  the  quadrant  diagonally  opposite  its 
previous  location,  and  each  mouse  was  given  four  trials,  as  before.  This  regimen 
is  referred  to  as  reversal  learning.  The  mice  were  retested  (with  the  platform  in 
the  original  position)  7  or  8  weeks  later.  The  animal’s  path  was  traced  on  an 
electronic  digitizing  tablet  containing  a  template  of  the  maze  (22).  The  following 
parameters  were  recorded:  total  time,  distance  traveled,  average  speed,  and 
percentage  of  time  spent  in  each  quadrant  of  the  maze. 


(vii)  Spatial  radial  arm  maze.  The  apparatus  and  testing  procedure  were 
described  previously  (30).  Hidden  escape  platforms  were  placed  at  the  ends  of 
four  of  the  eight  identical  arms  which  radiated  from  the  central  area.  During 
training  the  animal  was  released  from  a  start  arm  and  had  to  find  one  of  the 
escape  platforms.  Once  a  platform  was  found,  the  arm  containing  that  platform 
was  blocked.  The  training  session  consisted  of  four  trials  in  which  a  mouse  had 
to  locate  all  four  platforms.  This  was  followed  by  11  testing  sessions  which 
differed  from  the  training  session  only  by  removing  the  platform  while  the  mouse 
was  in  its  home  cage  between  trials.  Thus,  on  successive  trials,  the  animal  had  to 
remember  which  alleys  it  had  previously  entered  and  not  enter  those  alleys  again. 
The  score  was  total  errors  obtained  by  summing  together  Working  Memory 
Correct  errors.  Working  Memory  Incorrect  errors,  and  Reference  Memory  er¬ 
rors  (30). 

(ix)  Nonspatial  radial  arm  maze  learning.  The  nonspatial  swimming  radial 
arm  maze  apparatus  and  testing  procedure  were  similar  to  the  above  spatial 
version,  except  for  the  following.  Each  of  the  arms  was  painted  in  a  different 
black/white  pattern:  solid  black,  solid  white,  vertical  stripes,  horizontal  stripes, 
black  with  white  polka  dots,  white  with  black  polka  dots,  zigzags,  and  the  galva¬ 
nized  steel  gray  color  (the  start  arm).  The  patterned  arms  which  contained 
platforms  were  different  for  each  mouse  but  remained  constant  throughout  the 
testing  of  a  given  mouse.  For  the  training  session  (session  1),  the  entrance  to  the 
arm  with  the  just  chosen  platform  was  blocked;  in  addition,  the  maze  was 
quasi-randomly  rotated,  so  that  each  patterned  arm  now  pointed  toward  a  dif¬ 
ferent  place  in  space.  As  in  the  spatial  version,  the  test  sessions  (sessions  2  to  12) 
were  the  same  as  the  training  session,  except  that  while  the  mouse  was  in  its 
home  cage,  each  platform  it  found  was  removed  from  the  maze,  and  the  arm’s 
entrance  remained  open  for  the  remainder  of  the  session.  Over  eight  trials,  or 
two  sessions,  each  patterned  arm  pointed  toward  each  of  the  eight  spatial  loca¬ 
tions  in  the  testing  room.  The  errors  were  scored  as  above. 

(x)  Lashley  maze  learning.  This  maze  contained  cul-de-sacs  that  an  animal  had 
to  learn  to  avoid,  along  with  choices  that  required  the  animal  to  learn  making 
correct  left  or  right  turns  (T-choices).  A  water  version  of  the  maze  was  used,  with 
the  temperature  at  19  to  20°C  (21),  All  mice  were  given  one  trial  per  day  for  5 
days.  An  animal  can  learn  this  maze  using  two  mutually  nonexclusive  strategies: 
by  memorizing  extramaze  spatial  cues,  or  by  memorizing  the  sequence  of  correct 
left  and  right  turns.  The  measures  of  learning  included  the  learning  index, 
defined  as  the  number  of  correct  entries  divided  by  total  number  of  entries,  the 
number  of  cul  entries  (entries  into  dead  ends)  when  swimming  forward  (i.e., 
toward  the  goal),  number  of  forward  T-choice  errors,  and  the  total  number  of 
backward  errors.  Retention  testing  was  done  7  weeks  later  for  experiment  1  and 
8  weeks  later  for  experiment  2. 

(xi)  Platform-leaving  test.  To  compare  the  exploratory  activities  of  two  pre¬ 
viously  untested  mice  on  the  same  platform,  we  devised  the  platform-leaving  test. 
One  Ntanl  mouse  of  strain  129  and  one  congenic  +/+  mouse  of  the  same 
gender  (either  a  littermate  or  a  nonlittermate)  were  placed,  at  the  same  time,  on 
the  platform  (16  by  22  cm)  2.7  cm  above  the  surface  of  a  laboratory  bench.  For 
each  test,  the  platform  was  covered  with  fresh  white  paper.  Mice  were  allowed  to 
either  explore  or  step  down  from  the  platform  up  until  the  cutoff  time  of  3  min. 
(Step-down  was  recorded  when  all  four  paws  of  a  mouse  were  no  longer  in 
contact  with  platform.)  The  number  and  genotypes  of  mice  leaving  the  platform 
first,  and  the  number  of  mice  not  leaving  by  3  min,  were  recorded.  A  total  of 
three  trials,  at  1-h  intervals,  were  carried  out  for  every  pair  of  mice  tested  on  a 
given  day.  Each  set  of  tests  employed  —10  pairs  of  mice  2  to  4  months  old.  Four 
independent  sets  of  tests,  with  previously  untested  mice,  were  carried  out  over  a 
period  of  6  months. 

RESULTS 

Construction  of  mice.  A  deletion-disruption  allele 

of  the  ~17-kb  mouse  Ntanl  gene  was  constructed  by  replacing 
Ntanl  exons  2  to  5  with  a  neo  cassette  (Fig.  IB).  Exons  2  to  5 
encoded  a  118-residue  region  of  the  310-residue  NTANlp 
(Nt^-amidase)  that  was  significantly  conserved  among  the 
NTANlp  proteins  of  different  species,  including  the  plant  ^ra- 
bidopsis  thaliana,  the  fly  Drosophila  melanogaster,  and  mam¬ 
mals  (data  not  shown).  Of  the  —1,000  ES  cell  clones  (strain 
129/CJ7)  resistant  to  both  G418  and  FIAU,  33  clones  con¬ 
tained  the  expected  mutation,  as  verified  by  PCR  and  Southern 
analyses  (data  not  shown).  Twelve  of  these  correctly  targeted 
ES  cell  clones  were  used  to  generate  male  chimeras,  and  in  six 
of  them  the  Ntanl~  allele  was  transmitted  through  the  germ 
line.  Male  chimeras  were  mated  with  either  129/SvEv  or 
C57BL/6  females,  yielding  Ntanl^^~  heterozygotes.  Inter¬ 
crosses  of  heterozygous  mice  produced  Ntanl~‘~  progeny  at 
the  expected  frequency  of  approximately  one  in  four  (Fig.  1C 
and  D),  indicating  that  the  absence  of  NTANlp  did  not  in¬ 
crease  embryonic  lethality.  The  behavioral  tests  described  be- 
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FIG.  2.  Northern  hybridization  analysis  of  the  total  RNA  isolated  from  +/ + 
and  Ntanl~^~  brains  and  testes.  (A)  Hybridization  using  probes  a  to  f  that 
encompassed  different  regions  of  the  Ntanl  cDNA  (indicated  in  panel  C).  (B) 
The  same  Northern  blots  were  hybridized  with  probes  specific,  respectively,  for 
the  mouse  Ubrl,  Atel,  and  p-actin  cDNAs.  (C)  Exons  of  Ntanl  and  the  hybrid¬ 
ization  probes  used.  The  sequence  of  a  208-bp  segment  of  the  Ntanl  cDNA  (nt 
896  to  930),  termed  the  IL-2  homology  region,  is  98.6%  identical  to  the  sequence 
of  a  206-bp  segment  in  the  3 '-flanking  untranslated  region  of  the  mouse  112  gene, 
which  encodes  IL-2  (24). 
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low  used  exclusively  Ntanl  mice  produced  through  matings 
in  the  strain  129  background. 

Expression  of  Ntanl  and  intracellular  localization  of 
NTANlp.  Several  regions  of  Ntanl  transcripts  were  analyzed 
using  RT-PCR  with  RNA  from  wild-type  and  Ntanl~^~  EF 
cells.  The  level  of  Man7-derived  RNA  in  the  Ntanl~^~  EF 
cells  was  below  the  RT-PCR  detection  threshold  (Fig.  IE  and 
data  not  shown).  Northern  hybridization  analyses,  using  probes 
encompassing  either  exons  2  to  5  (the  deleted  region)  or  exons 
1  to  5  detected  the  1.4-  and  1.6-kb  Ntanl  mRNAs,  respectively, 
in  the  brains  and  testes  of  +/+  mice;  the  +/+  testes  also 
contained  the  Mfl/i7-derived  RNAs  of  1.1  and  4  kb  (Fig.  2A). 
No  Ntanl  transcripts  were  detected  in  the  brains  and  testes  of 
Ntanl  mice  with  these  probes  (Fig.  2Ab  and  d). 

Similar  analyses  but  using  probes  encompassing  exons  1  to 
10  or  2  to  10  (except  for  the  interleukin-2  [IL-2]  similarity 
region  of  exon  10  [24])  detected  at  most  trace  amounts  of 
Ntanl -dQhwod  transcripts  in  the  brains  oiNtanl~^~  mice  (Fig. 
2Aa  and  c).  However,  in  the  testes  of  Ntanl~^~  mice,  the  same 
probes  detected  1.1-  and  4-kb  Mani-derived,  testis-specific 
transcripts  but  not  the  1.6-kb  transcript  (Fig.  2Aa  and  c).  Fur¬ 
ther  mapping,  using  the  probes  shown  in  Fig.  2C,  indicated  that 
the  1.1-  and  4-kb  transcripts  hybridized  to  Ntanl  exons  6  to  10 


and  exons  9  and  10,  respectively  (Fig.  2Ae  and  f).  These  tran¬ 
scripts  were  detected  with  antisense  but  not  sense  probes  (data 
not  shown),  indicating  a  direction  of  transcription  identical  to 
that  for  the  full-length  Ntanl  mRNA.  Thus,  the  testes  but  not 
the  brains  of  Ntanl~^~  mice  contained  a  set  of  Man7-derived 
transcripts  that  lacked  the  5'  half  of  the  Ntanl  ORF  (including 
the  region  deleted  in  Ntanl  mice)  and  encompassed  its  3' 
half.  Apparently  the  same  transcripts  were  present  in  the  +/+ 
testes  (Fig.  2Ae  and  f).  Whether  these  testis-specific  transcripts 
are  physiologically  relevant  remains  to  be  determined. 

We  also  examined  by  Northern  analysis  whether  the  expres¬ 
sion  of  other  components  of  the  N-end  rule  pathway  was  al¬ 
tered  in  the  Ntanl''^^  mice.  No  changes  in  the  expression  of 
either  Ubrl  (encoding  the  E3  of  the  N-end  rule  pathway)  or 
Atel  (encoding  R- transferase  [34];  see  the  introduction)  were 
detected  in  the  brains  and  testes  of  Ntanl~^~  mice  (Fig.  2B). 
The  expression  of  Ubr2  and  Ubr3  (homologs  of  Ubrl\  Kwon 
and  Varshavsky,  unpublished  data)  and  of  mHR6B,  encoding 
the  E2i4k  Ub-conjugating  (E2)  enzyme,  was  also  unchanged  in 
the  Ntanl~^~  mice  (data  not  shown).  In  addition,  no  alter¬ 
ations  in  the  ratio  of  Atel-1  mRNA  to  Ate  1-2  mRNA,  which 
encode  two  splicing-derived  forms  of  the  mouse  R-transferase 
(34),  was  detected  in  the  brains,  testes,  and  EF  cells  of 
Ntanl~^^  mice  (data  not  shown). 

Ntanl  was  expressed  in  most  if  not  all  tissues  of  adult  mice 
(24).  Whole-mount  in  situ  hybridization  was  used  to  examine 
the  expression  of  Ntanl  in  +/+  mouse  embryos.  In  e9.75  em¬ 
bryos,  Ntanl  was  most  strongly  expressed  in  the  branchial 
arches,  the  tail  bud,  and  the  forelimb  buds;  the  same  probe 
detected  no  signal  in  the  Ntanl~^~  embryos  (Fig.  3A).  In  elO.5 
and  ell.5  embryos,  the  expression  of  Ntanl  became  high  in  the 
hindlimb  buds  as  well;  this  expression  pattern  was  indistin¬ 
guishable  from  that  of  Ubrl  (Fig.  3A),  consistent  with  the 
assignment  of  NTANlp  and  UBRlp  to  the  same  pathway. 

To  determine  the  intracellular  location  of  NTANlp,  it  was 
expressed  in  mouse  3T3  cells  as  a  fusion  to  the  N  terminus  of 
GFP.  Whereas  the  free  26-kDa  GFP  was  distributed  uniformly 
throughout  the  cell,  the  65-kDa  NTANlp-GFP  was  enriehed  in 
the  nucleus,  was  also  present  in  the  nucleus-proximal  cyto¬ 
plasm,  but  was  apparently  much  less  abundant  at  the  cells’ 
periphery  (Fig.  3B  and  data  not  shown). 

Ntanl~^~  mice  lack  Nt^-amidase  and  the  asparagine-spe¬ 
cific  branch  of  the  N-end  rule  pathway.  To  determine  whether 
Ntanl~^~  EF  cells  lacked  Nt^-amidase  activity,  we  used  a 
previously  described  assay  (24).  The  ^^S-labeled  X-DHFR  test 
proteins  (X  =  Asn,  Gin,  or  Asp)  were  incubated  with  a  whole¬ 
cell  extract  and  thereafter  fractionated  by  lEF,  which  sepa¬ 
rated  Asn-DHFR  and  Gln-DHFR  from  Asp-DHFR  and  Glu- 
DHFR,  respectively.  Incubation  of  Asn-DHFR  with  the 
extract  from  +/+  EF  cells  shifted  the  pi  of  this  protein  to  that 
of  Asp-DHFR.  In  contrast,  the  pi  of  Asn-DHFR  remained 
unchanged  after  an  otherwise  identical  incubation  with  the 
same  amount  of  extract  from  Ntanl~^~  EF  cells  (Fig.  4A).  The 
pi  of  Gln-DHFR  shifted  to  that  of  Glu-DHFR  after  incuba¬ 
tions  with  either  +/+  or  Ntanl~^~  extracts  (Fig.  4A).  Thus, 
Ntanl~^^  EF  cells  lacked  Nt^-amidase  activity  but  contained 
wild-type  amounts  of  the  Gin-specific  Nt^-amidase.  The  latter 
result  indicated  that  the  putative  mammalian  Nt^-amidase 
(which  remains  to  be  isolated  and  cloned)  is  encoded  by  a  gene 
distinct  from  Ntanl. 

To  examine  the  in  vivo  degradation  of  N-end  rule  substrates 
in  the  absence  of  Nt’^-amidase,  the  immortalized  +/+  and 
Ntanl  EF  cell  lines  were  transiently  transfected  with  plas¬ 
mids  that  expressed  X-nsP4pgal  test  proteins  (X  =  Met,  Asn, 
Gin,  or  Arg).  An  X-nsP4Pgal  was  expressed  as  part  of  a  fusion 
containing  the  HA-tagged  reference  protein  DHFR.  In  this 
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FIG.  3.  Expression  of  Ntani  mRNA  and  localization  of  NTANlp.  (A)  Whole-mount  in  situ  hybridization  of  wild-type  (wt)  and  Ntanl~^~  embiyos  (left  four  panels, 
light  background)  with  an  antisense  RNA  probe  derived  from  a  0.3-kb  fragment  of  the  Ntani  cDNA  (nt  108  to  448)  that  was  absent  from  the  Ntanl~^~  allele.  The 
regions  of  high  Ntani  expression  in  the  tail  buds  (t),  forelimb  buds  (fl),  and  hindlimb  buds  (hi)  are  indicated.  The  right  two  panels  (dark  background)  show  the  results 
of  in  situ  hybridization  with  an  antisense  (AS)  Ubrl  cDNA  probe.  (B)  Intracellular  localization  of  mouse  NTANlp.  NIH  3T3  cells  were  transiently  transfected  with 
a  plasmid  expressing  the  NTANlp-GFP  fusion  (see  Materials  and  Methods).  Typical  fluorescence  patterns  (a,  c,  and  e)  and  the  matching  phase-contrast  images  (b 
d,  and  f)  are  shown  (see  text).  &  v  ’ 


previously  developed  UPR  (Ub  protein  reference)  technique 
(40,  60),  a  DHFR-HA-UbR48-X-nsP4pgal  fusion  is  cotransla- 
tionally  cleaved  by  DUBs  at  the  Ub>X-nsP4pgal  junction, 
yielding  the  long-lived  DHFR-HA-UbR48  reference  protein 
and  the  test  protein  X-nsP4pgal.  In  the  UPR  technique,  the 
reference  protein  serves  as  an  internal  control  for  the  levels  of 


expression,  immunoprecipitation  yields,  sample  volumes,  and 
other  sources  of  sample-to-sample  variation,  thereby  increas¬ 
ing  the  accuracy  of  pulse-chase  assays  (40,  60). 

The  previously  introduced  term  ID^  (initial  decay,  i.e.,  the 
extent  of  degradation  of  a  protein  during  the  pulse  of  jc  minutes 
[40])  was  used  to  describe  the  in  vivo  decay  curves  of  test 


VoL.  20,  2000 


MOUSE  N-TERMINAL  AMIDASE  4141 


FIG.  4.  Mouse  Ntanr^~  cells  lack  Nt^-amidase  and  are  unable  to  degrade  the  normally  short-lived  N-end  rule  substrates  bearing  N-terminal  Asn.  (A)  ^^S-labeled, 
purified  X-DHFR  test  proteins  (X  =  Asn,  Gin,  or  Asp)  were  incubated  for  2  h  at  37'’C  with  buffer  alone  (negative  controls  [NC])  or  with  extracts  from  either  wild-type 
or  Ntanl~^~  EF  cells,  followed  by  lEF  and  autoradiography.  The  assays  were  carried  using  either  the  initial  extracts  (lanes  a)  or  the  same  extracts  diluted  with  buffer 
10-,  100-,  and  1,000-fold  (lanes  b,  c,  and  d,  respectively).  The  lEF  positions  of  X-DHFRs  bearing  N-terminal  Asp  or  Glu  versus  Asn  or  Gin  are  shown  on  the  left.  The 
corresponding  pH  values  are  indicated  on  the  right.  (B)  Immortalized  +/+  and  Ntanr>-  EF  cells  (see  Materials  and  Methods)  were  transiently  transfected  with 
plasmid  pRC/dhalJbXnsP4pgal,  which  expressed  DHFR-HA-Ub^‘^-X-nsP4Pgal  test  proteins  (X  =  Met,  Asn,  Gin,  or  Arg)  (40).  These  proteins  were  cotranslationally 
cleaved  in  vivo  by  DUBs,  yielding  the  long-lived  reference  protein  DHFR-HA-Ub^‘*®  and  the  test  protein  X-nsP4pgal  (X  =  Met,  Asn,  Gin,  or  Arg)  (see  text).  Cells 
were  labeled  with  pS]methionine-cysteine,  followed  by  a  chase  for  0,  1,  and  2  h  (as  indicated  at  the  top)  in  the  presence  of  cycloheximide,  preparation  of  extracts, 
immunoprecipitation,  SDS-PAGE,  autoradiography,  and  quantitation,  essentially  as  described  elsewhere  (40).  The  bands  of  XmsP4pgal  proteins  and  the  DHFR-HA- 
UbR48  reference  protein  are  indicated  on  the  right  as  X-pgal  and  DHFR.  (C)  Same  as  panel  B  but  with  immortalized  Ntanl  ^  EF  cells.  (D)  Quantitation  of  the  in 
vivo  degradation  of  X-nsP4|3gal  test  proteins  using  the  reference-based  pulse-chase  patterns  in  panels  B  and  C  (see  Materials  and  Methods).  The  amounts  of  in 
an  X-nsP43gal  protein,  relative  to  in  the  DHFR-HA-Ub^"^®  reference  protein  at  the  same  time  points,  were  plotted  as  percentages  of  this  ratio  for  Met-nsP4pgal 
at  time  zero.  (Met-nsP4pgal  bore  a  stabilizing  N-terminal  residue.)  Open  and  closed  symbols,  wild-^e  and  Ntanr^~  EF  cells,  respectively;  □  and  ■,  Met-nsP4pgal; 
O  and  •,  Asn-nsP4pgaI;  V  and  T,  Arg-nsP4pgal;  A  and  ▲,  Gln-nsP4pgal. 


proteins.  The  term  ID^  is  superfluous  in  the  case  of  a  strictly 
first-order  decay,  which  is  defined  by  a  single  half-life.  How¬ 
ever,  the  in  vivo  degradation  of  most  proteins  deviates  from 
first-order  kinetics.  Specifically,  the  rate  of  degradation  of 
short-lived  proteins  can  be  much  higher  during  the  pulse,  in 
part  because  a  newly  labeled  (either  nascent  or  just  completed) 
polypeptide  is  conformationally  immature  and  consequently 
may  be  targeted  for  degradation  more  efficiently  than  its  ma¬ 
ture  counterpart.  This  enhanced  early  degradation,  previously 
termed  the  zero-point  effect  (5),  is  described  by  the  parameter 
Uy  (40).  It  was  found  that  a  large  fraction  of  the  zero-point 
effect  results  from  the  cotranslational  degradation  of  nascent 
(being  synthesized)  polypeptides,  which  never  reach  their  ma¬ 
ture  size  before  their  destruction  by  processive  proteolysis  (G. 
Turner  and  A.  Varshavsky,  unpublished  data). 

Asn-nsP4(3gal  was  metabolically  unstable  in  +/+  EF  cells: 
its  (percent  degradation  during  the  10-min  pulse)  was 
~45%,  in  comparison  to  of  the  long-lived  Met-nsP4pgal, 
taken  as  100%  (Fig.  4B  and  D).  However,  the  same  Asn- 
nsP4pgal  was  completely  (and  reproducibly)  stabilized  in  the 
Ntanl~^~  EF  cells,  its  pulse-chase  pattern  becoming  indistin¬ 
guishable  from  that  of  the  normally  long-lived  Met-nsP4Pgal. 
In  contrast,  both  Gln-nsP4pgal  and  Arg-nsP4pgal  were  com¬ 
parably  short-lived  in  both  +/+  and  Ntanl~^~  cells  ^ig.  4B  to 
D).  Taken  together  with  the  measurements  of  Nt  -amidase 


enzymatic  activity  (Fig.  4A),  these  data  indicated  that  the 
Ntanl~^~  EF  cells,  in  contrast  to  congenic  wild-type  cells, 
lacked  the  Asn-specific  branch  of  the  N-end  rule  pathway  but 
retained  the  rest  of  this  pathway. 

Motor  coordination  and  spontaneous  activity  of  Ntanl~^~ 
mice.  The  Ntanl~^~  mice  were  fertile,  apparently  healthy,  and 
of  similar  size  and  weight  as  +/+  littermates,  and  they  cared 
for  their  offspring.  These  mice  oriented  to  sound;  their  limb 
movements  and  behavior  appeared  to  be  indistinguishable 
from  those  of  congenic  +/+  mice.  Histological  examination  of 
Ntanl~’~  tissues  (small  intestine,  liver,  pancreas,  adrenal 
gland,  thyroid  gland,  kidney,  ovary,  testis,  heart,  spleen,  thy¬ 
mus,  skeletal  muscle,  brain,  and  sciatic  nerve)  did  not  detect 
abnormalities.  No  significant  differences  were  observed  be¬ 
tween  +/+  and  Ntanl~^~  thymocytes  in  the  ability  to  undergo 
apoptosis  in  response  to  either  radiation  or  dexamethasone, 
suggesting  that  Ntanl~^~  mice  were  not  impaired  in  apoptotic 
responses.  Thus  far,  the  only  nonbehavioral  differences  be¬ 
tween  +/+  and  congenic  Ntanl~^~  mice  were  a  weaker  mito¬ 
genic  response  of  Ntanl~^~  splenocytes  and  thymocytes  to 
phytohemagglutinin  and  hypersensitivity  of  Ntanl  mice  to 
bacterial  lipopolysaccharide  (data  not  shown).  In  addition, 
over  the  last  1.5  years  the  frequency  of  natural  death  among 
Ntanl~^~  mice  was  -50%  higher  than  among  congenic  +/+ 
mice. 
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FIG.  5.  Normal  motor  coordination  and  reduced  spontaneous  activity  oiNtanl~^~  mice.  Open  symbols:  +/+  mice;  solid  symbols;  congenic  Ntanl~^~  mice  (see 
Materials  and  Methods).  Statistically  significant  differences  {P  <  0,05)  are  indicated  by  *.  Experiments  used  2ANtanl~'~  mice  and  their  congenic  +/+  littermates  (A 
to  C)  or  nonlittermates  (D  to  G).  (A)  Rotarod  test.  Data  show  time  elapsed  before  the  animals  fell  from  a  horizontal  rod  rotating  at  10  rpm  (squares)  or  20  rpm  (circles). 
No  significant  differences  were  found  between  +/+  and  Ntanl~^~  mice.  (B)  Weight  retention  test.  Data  show  time  elapsed  before  the  animals  released  a  hook.  The 
height  of  each  bar  represents  the  time  averaged  from  360  trials.  There  were  no  significant  differences  between  +/+  and  Ntanl~'-  mice.  (C)  Coat  hanger  test.  Squares, 
time  elapsed  before  the  animal  grabbed  the  horizontal  bar  with  both  rear  paws;  circles,  time  elapsed  before  the  front  two  paws  reached  one  of  the  side  bars.  Each  data 
point  is  an  average  of  48  trials.  There  were  no  significant  differences  between  +/+  and  Ntanr^-  mice,  (D)  Shuttlebox  avoidance  test.  Shown  are  the  number  of  null 
responses  per  50  daily  trials  in  which  the  mouse  remained  in  the  original  compartment  of  the  shuttlebox  and  received  20  s  of  electric  shock  during  the  original  learning 
and  retention  testing  7  to  10  weeks  later.  (E)  Passive  avoidance  test,  measuring  time  before  entering  a  dark  chamber  after  an  automated  guillotine  door  opened, 
exposmg  the  dark  chamber.  On  day  1  the  mice  were  shocked  after  entering  the  dark  chamber;  13  Ntanr^~  mice  and  12  congenic  +/+  mice  were  used.  (F)  Open-field 
test.  Shown  are  the  total  distance  traveled  by  the  animal  during  the  observation  time  of  9  min  (a),  time  (out  of  9  min  in  total)  that  the  animal  spent  in  the  center  area 
(b),  and  time  (out  of  9  min  in  total)  that  the  animal  spent  within  1  cm  of  the  walls.  This  set  of  tests  used  28  Ntanl~^~  mice  and  27  congenic  -I-/+  mice. 


Motor  coordination  of  Ntanl~'~  and  congenic  +/+  mice 
was  compared  using  the  rotarod  apparatus.  No  difference  in 
the  ability  of  mice  to  stay  on  a  rotating  horizontal  rod  was 
observed  between  the  Ntanl~^~  and  +/+  strains  (Fig.  5A). 
Comparisons  of  physical  strength  (weight  retention  test),  of 
both  coordination  and  strength  (coat  hanger  test),  and  of  walk¬ 
ing  patterns  (hindpaw  footprint  test)  also  did  not  distinguish 
between  and  +/+  mice  (Fig.  5B  and  C  and  data  not 

shown).  These  and  related  observations  indicated  that 


Ntanl~^~  mice  were  not  impaired  in  either  motor  coordina¬ 
tion,  physical  strength,  or  the  learning  ability  required  for 
improved  performance  in  repeated  trials  (Fig.  5A  and  C). 

The  locomotor  activity  of  Ntanl~^~  mice  was  assayed  in  the 
open-field  test.  By  several  criteria,  Ntanl~^~  mice  exhibited 
significantly  lower  spontaneous  activity  than  the  congenic  wild- 
type  mice  (Fig.  5F).  Specifically,  Ntanl~^~  mice  traveled  sig¬ 
nificantly  less  distance  (Fig.  5Fa),  spent  significantly  more  time 
near  the  field's  center  (Fig.  5Fb),  and  consequently  spent  less 
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time  within  1  cm  from  the  walls  (Fig,  5Fc).  These  findings  will 
be  considered  in  conjunction  with  data  from  the  platform¬ 
leaving  test  (see  Fig,  6  and  Discussion), 

Active  and  passive  avoidance  learning.  The  shuttlebox 
avoidance  test  compared  shock-motivated  learning  in  +/+  and 
congenic Ntanl~^^  mice.  In  this  active  avoidance  learning  pro¬ 
cedure,  a  light-conditioned  stimulus  precedes,  by  5  s,  the  un¬ 
conditioned  stimulus  of  electric  shock,  which  remains  on  for 
20  s.  The  animal  can  (i)  cross  to  the  safe  compartment  within 
the  5-s  conditioned-stimulus  interval,  thus  avoiding  shock;  (ii) 
cross  over  during  the  20-s  unconditioned-stimulus  interval,  to 
escape  the  ongoing  shock;  or  (iii)  not  cross  over  and  receive 
the  full  20  s  of  shock  (the  latter  outcome  is  scored  as  the  null 
response).  Neither  +/+  nor  Ntanl~^~  mice  showed  any  evi¬ 
dence  of  learning  over  the  5  days  of  testing.  However,  there 
were  performance  differences  in  the  distribution  of  their  re¬ 
sponses.  The  two  groups  averaged  only  four  avoidance  re¬ 
sponses  per  day,  but  the  +/+  mice  exhibited  more  escape 
responses  and  therefore  fewer  null  responses  than  the 
Ntanl~^~  mice  (Fig.  5D).  Neither  group  showed  evidence  of 
learning  during  the  5  days  of  retention  testing,  but  the  number 
of  null  responses  by  Ntanl~^~  mice  decreased  significantly  in 
comparison  to  their  performance  during  learning  (data  not 
shown),  indicating  that  memory  consolidation  had  occurred  in 
this  group  between  the  original  testing  and  retesting  7  to  10 
weeks  later.  Strain  129  mouse  substrains  are  known  to  perform 
poorly  in  active  avoidance  tests  (7,  53). 

In  the  shuttlebox  text,  an  electric  shock  is  used  to  compel  the 
animal  to  make  an  active  response.  By  contrast,  in  passive 
learning,  the  shock  is  used  to  compel  the  animal  to  inhibit  its 
behavior.  The  mouse  was  placed  into  a  chamber.  Ten  seconds 
later,  the  chamber  was  illuminated  and  a  guillotine  door 
opened,  exposing  a  dark  chamber.  After  the  animal  moved  into 
the  dark  (the  natural  behavior),  the  guillotine  door  closed  and 
the  animal  received  0.4  mA  of  foot  shock  for  2  s.  The  time  it 
took  the  mouse  to  enter  the  dark  chamber  (until  the  cutoff 
time  of  400  s)  was  recorded.  The  mouse  was  put  back  in  its 
home  cage;  24  h  later,  the  mouse  was  placed  back  into  the 
original  chamber,  and  all  stimulus  conditions  were  repeated 
except  that  no  shock  occurred  when  the  mouse  entered  the 
dark  chamber.  The  usual  finding  is  that  the  mouse  takes  longer 
to  enter  the  dark  chamber  on  the  second  day  because  the  prior 
shock  experience  inhibits  its  behavior.  Figure  5E  shows  the 
time  before  entry  into  the  dark  chamber  (latency  time)  on  both 
days  for  the  two  genotypes.  On  the  first  day,  the  Ntanl  mice 
took  significantly  longer  to  enter  the  dark  chamber  than  the 
-1-/+  animals.  On  the  second  day,  the  entry  into  the  dark 
chamber  by  the  Ntanl'^'^  mice  was  further  delayed  (indicating 
learning),  while  the  entry  by  the  +/-1-  mice  was  delayed  even 
more,  resulting  in  similar  retention  times  for  either  genotype 
(Fig.  5E). 

The  longer  retention  time  of  the  Ntanl  ^  mice  on  the  first 
day  of  passive  learning,  as  well  as  their  lower  activity  in  the 
open-field  test,  suggested  that  +/+  and  congenic  Ntanl" 
mice  might  differ  in  their  anxiety  levels.  To  investigate  this, 
they  were  tested  in  the  elevated  plus  maze,  which  consists  of 
two  open  arms  and  two  closed  arms  radiating  from  a  center 
platform.  The  open  arms  are  more  anxiety  inducing,  in  that  the 
animals  with  higher  anxiety  tend  to  decrease  exploration  of  the 
open  arms  (42).  Although  the  Ntanl~^~  mice  tended  to  ex¬ 
plore  the  open  arms  less  frequently  than  congenic  +/+  mice, 
the  observed  difference  was  not  statistically  significant  (data 
not  shown). 

A  socially  conditioned  exploratory  phenotype  of  Ntanl  ' 
mice.  To  address  the  finding  of  lower  activity  in  the  Ntanl"’" 
mice  (Fig.  5F)  in  a  different  way,  we  devised  a  simple  test  in 


which  the  exploratory  activity  was  assessed  in  the  context  of  a 
social  interaction.  In  this  platform-leaving  test,  two  previously 
untested  mice,  Ntanl"’"  and  congenic  +/+,  are  placed,  close 
together  and  at  the  same  time,  onto  the  center  of  a  16-  by 
22-cm  platform  2.7  cm  in  height.  Thereafter  one  records  the 
following  variables:  the  time  it  takes  each  mouse  to  leave  the 
platform,  until  the  cutoff  time  of  3  min;  the  number  and  ge¬ 
notype  of  mice  leaving  the  platform  first  (in  independent  trials 
with  pairs  of  mice);  and  the  number  and  genotypes  of  mice  not 
leaving  the  platform  by  the  cutoff  time  (see  Materials  and 
Methods).  In  contrast  to  the  standard  open-field  test,  a  mouse 
in  the  platform-leaving  test  sees  and  smells  the  second  mouse 
on  the  same  platform  and  is  otherwise  influenced  by  the  sec¬ 
ond  mouse  in  the  course  of  their  peregrinations  over  the  plat¬ 
form.  Thus,  the  proclivity  of  a  mouse  to  explore  its  environ¬ 
ment  is  modulated,  in  the  platform-leaving  test,  by  interactions 
with  another  mouse. 

A  striking  result  of  this  test  was  the  observation  that  an 
Ntanl"’"  mouse  paired  with  its  congenic  +/+  littermate 
tended  to  stay  longer  on  the  platform  and  almost  never  left  it 
by  the  cutoff  time,  in  contrast  to  the  +/+  mouse  (Fig.  6A).  For 
example,  in  a  trial  with  44  pairs  of  previously  untested 
Ntanl"’"  and  +/+  mice,  only  1  Ntanl"’"  mouse  left  the  plat¬ 
form  by  the  cutoff  time,  whereas  20  +/+  mice  left  it  by  that 
time  (Fig.  6Ac).  Repeated  trials,  at  1-h  intervals,  in  the  plat- 
form-leaving  test  progressively  decreased  this  initial  difference 
between  the  paired  Ntanl"’"  and  +/+  mice  (Fig.  6A).  Note 
that  this  decrease  resulted  from  both  an  increase  in  the  explor¬ 
atory  activity  of  Ntanl"’"  mice  (in  successive  trials)  and  a 
decrease  in  the  exploratory  activity  of  their  paired  +/+  litter- 
mates  (e.g..  Fig.  6Ac).  Thus,  the  strongly  different  initial  re¬ 
sponses  of  paired  Ntanl"’"  and  +/+  mice  to  the  novel  envi¬ 
ronment  (of  which  the  other  mouse  was  a  part)  became 
attenuated  in  subsequent  trials  (Fig.  6A)  and  nearly  disap¬ 
peared  by  the  seventh  trial  (data  not  shown). 

We  then  asked  whether  the  observed  difference  between 
Ntanl"’"  and  +/+  mice  was  retained  if  the  same  test  was 
performed  with  congenic  nonlittermates.  Strikingly,  if  the  test 
of  Fig,  6A  was  carried  out  with  wild-type  and  mutant  nonlit¬ 
termates,  the  results  were  reproducibly  different  from  those 
with  littermates  (Fig,  6B).  Specifically,  in  this  case  Ntanl"’" 
mice  did  not  stay  longer  on  the  platform:  they  left  it,  on  aver¬ 
age,  more  frequently  than  +/+  mice,  a  pattern  opposite  that 
seen  with  littermates  (Fig.  6Bb;  compare  with  Fig.  6Ab).  In 
addition,  the  striking  difference  between  Ntanl"’"  and  +/+ 
littermates  in  their  frequency  of  not  leaving  the  platform  by  the 
cutoff  time  was  no  longer  seen  in  the  otherwise  identical  tests 
with  nonlittermates  (Fig.  6Ac;  compare  with  Fig.  6Bc). 

In  the  control  experiments,  multiple  pairs  of  littermates  of 
the  same  genotype  (either  both  +/+  or  both  Ntanl"’  )  were 
subjected  to  this  test.  No  statistically  significant  differences 
between  the  platform-leaving  patterns  of  the  two  mice  in  a  pair 
were  observed  (data  not  shown).  The  results  presented  in  Fig. 
6  were  consistently  reproduced  in  four  independent  sets  of 
tests,  carried  out  over  6  months  with  previously  untested  mice 
(at  least  10  pairs  of  mice  per  test).  This  socially  conditioned 
exploratory  phenotype  of  Ntanl "’"  mice  (Fig.  6)  has  not  been 
previously  described,  to  our  knowledge,  with  other  mouse 
strains. 

Spatial  learning  and  memory.  To  compare  spatial  learning 
of  +/+  and  congenic  Ntanl"’"  mice,  we  tested  them  on  the 
hidden  platform  Morris  maze  (where  a  mouse  had  to  find  a 
platform  submerged  in  water  pool),  using  the  reversal  learning 
regimen  (see  Materials  and  Methods)  and  retesting  7  or  8 
weeks  later.  Both  +/+  and  Ntanl" ’"  mice  learned  to  find  the 
platform  and  did  not  differ  in  either  the  time  taken  to  do  it  or 
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FIG.  6.  Socially  conditioned  differences  in  e^loratory  behavior  between  +/+  and  congenic  Ntanl  ’  mice.  Open  bars,  +/+  mice;  solid  bars,  congenic  Ntanl~'~' 
mice.  In  the  platform-leaving  test,  one  Ntanl  mouse  and  one  +1+  mouse  (either  a  littermate  or  a  nonlittermate)  were  placed,  at  the  same  time,  on  a  16-  by  22- 
by  2.7-cm  platform  Mice  were  allowed  to  either  explore  or  step  down  from  the  platform  until  the  cutoff  time  of  180  s.  Bars  show  the  time  by  which  each  animal  left 
the  platform  (a),  the  number  of  animals,  of  each  genotype,  leaving  first  (b),  and  the  number  of  animals,  of  each  genotype,  not  leaving  by  the  cutoff  time  (c).  Four 
independent  sets  of  tests  (at  least  10  pairs  of  mice  per  test)  were  carried  out  over  6  months,  using  44  pairs  of  -F/+  and  Ntanl~^~  littermates,  which  were  produced 
through  heterozygous  matings  and  identified  by  genotyping  -700  mice.  The  results  differed  by  less  than  15%  among  the  independent  tests.  (B)  Same  as  panel  A  except 
with  pairs  of  nonlittermates  (24  -h/-h  mice  and  24  Ntanr‘-  mice).  ^  v  /  y 


the  distance  traveled;  they  also  did  not  differ  on  these  mea¬ 
sures  in  the  subsequent  reversal  learning  test,  when  the  plat¬ 
form  was  moved  to  the  diagonally  opposite  quadrant  (data  not 
shown).  However,  7  to  8  weeks  later,  on  the  first  day  of  reten¬ 
tion  testing,  the  Ntanl  mice  were  inferior  to  congenic  +/+ 
mice,  in  that  they  took  significantly  longer  time  to  find  the 
platform  and  traveled  a  significantly  longer  distance  before 
reaching  it  (data  not  shown),  suggesting  that  the  Ntanl~^~ 
mice  have  a  less  effective  spatial  memory  system. 

Another  measure  of  spatial  learning  employs  the  water  ver¬ 
sion  of  the  radial  arm  maze,  where  four  of  the  eight  arms 
contain  escape  platforms  (30).  There  are  no  intramaze  cues 
available  to  the  mouse,  and  it  must  use  extramaze  spatial 
information  involving  reference  and  working  memory  to  re¬ 
member  which  alleys  to  enter  and  which  to  avoid.  The  mice 
were  given  training  for  1  day  and  tested  for  11  consecutive 
days,  receiving  four  trials  each  day.  The  measure  used  was  the 
total  number  of  errors.  Ntanl  mice  made  significantly  more 
errors  in  the  acquisition  phase  (sessions  2  to  7)  than  the  con¬ 
genic  +/-H  mice  (Fig.  7A).  In  the  asymptotic  phase  (sessions  8 
to  12),  there  were  no  significant  differences  between  the  two 
strains  (Fig.  7A), 

The  four  trials  within  each  daily  session  impose  an  increas¬ 
ing  demand  on  memory.  Thus,  on  the  first  trial,  there  are 
platforms  in  four  of  the  eight  alleys,  and  entering  any  of  these 
terminates  the  trial.  On  the  second  trial,  three  of  the  alleys  still 
contain  platforms;  the  mouse  has  to  remember  which  alley  it 
had  entered  on  the  prior  trial  and  not  enter  that  alley  again. 
Finally,  on  the  fourth  trial,  the  mouse  has  to  remember  which 
three  alleys  it  had  chosen  earlier,  so  that  it  can  find  the  one 
remaining  alley  containing  a  platform.  Thus,  by  studying  the 


error  rate  from  trial  to  trial,  summed  over  the  sessions,  one  can 
compare  the  abilities  of  mice  to  cope  with  an  increasing  mem¬ 
ory  load.  Figure  7C  shows  the  error  curves  summed  over  the 
acquisition  phase.  As  expected,  errors  increased  as  the  memory 
load  increased.  However,  the  rate  of  increase  was  significantly 
greater  for  the  Ntanl~'~  mice,  especially  on  trial  4  (Fig.  7C). 

Unlike  previously  tested  mouse  strains  (30),  both  +/+  and 
congenic  Ntanl  mice  exhibited  poor  learning  over  the  11 
days  of  testing,  consistent  with  inferior  performance  of  strain 
129-derived  mice  in  several  other  learning  regimens  (7,  53). 
Thus,  the  significant  effects  described  above  (Fig.  7A  and  C) 
represent  largely  differences  in  performance  but  indicate  little 
about  the  spatial  learning  ability  of  these  mice.  To  determine 
whether  the  observed  effects  were  related  to  spatial  compe¬ 
tence,  the  identical  experiment  was  carried  out  using  the  same 
radial  arm  maze,  except  that  a  nonspatial  version  was  devised, 
by  having  different  visual  patterns  in  each  alley  and  rotating  the 
maze  between  trials  so  that  the  extramaze  cues  were  rendered 
meaningless.  In  four  of  the  arms,  escape  platforms  were 
placed,  and  the  mice  had  to  learn  to  associate  a  specific  visual 
configuration  (e.g,,  black  dots  on  a  white  background,  or  hor¬ 
izontal  black  and  white  stripes)  with  the  presence  or  absence  of 
a  platform  (29).  In  contrast  to  the  results  with  the  spatial  radial 
arm  maze,  no  significant  differences  between  the  two  geno¬ 
types  were  observed  in  the  nonspatial  radial  arm  maze,  either 
in  regard  to  the  overall  number  of  errors  (Fig.  7B)  or  in  regard 
to  the  error  rate  as  a  function  of  memory  load  (data  not 
shown).  Here,  too,  both  +/+  and  congtmc  Ntanl~^~  mice  did 
not  exhibit  significant  learning  over  the  11  days  of  testing. 
Thus,  no  conclusions  can  be  drawn  about  spatial  or  nonspatial 
learning.  At  the  same  time,  it  was  possible  to  conclude  that 
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FIG  7  Spatial  learning  and  memory  in  Ntanl~‘~  mice.  Open  symbols  and  bars  represent  +/  +  mice;  solid  symbols  and  bars  represent  congemc  Ntanl  mice.^e 
the  text  for  details.  (A  to  C)  Results  for  10  +/+  mice  and  10  Ntanr^-  mice.  (A)  Mean  daily  errors  made  by  mice  for  each  phase  m  the  spatial  radial  arm  maze,  '^e 
data  were  separated  into  those  for  acquisition  phase  (sessions  2  to  7)  and  asymptotic  phase  (sessions  8  to  12).  (B)  Same  as  panel  A  except  that  the  mice  test^ 
in  the  nonspatial  radial  arm  maze,  in  which  the  spatial  cues  were  replaced  by  nonspatial  cues.  (Q  Effects  of  memory  load  in  the  water  version  of  the  spatial  ra^l  arm 
maze.  Shown  are  total  errors  per  trial.  (D)  Performance  of  29  +/+  mice  and  30  congenic  Ntanl  ^  mice  in  the  Lashley  maze.  Shown  are  the  ratio  of  the  number  of 
correct  path  segments  taken  to  the  total  number  of  segments  (learning  index)  (a),  errors  in  T-choices  (b),  and  errors  made  by  moving  away  from  the  goal  (c). 


Ntanl" mice  were  less  effective  in  dealing  with  spatial  infor¬ 
mation  than  the  congenic  +/+  mice. 

Another  test  of  spatial  learning  and  memory  was  the  Lashley 
III  maze.  This  maze  contains  (i)  cul-de-sacs  that  an  animal  has 
to  learn  to  avoid  and  (ii)  T-choices.  Both  +/+  and  Ntanl  ' 
mice  were  able  to  learn  the  maze,  but  Ntanl~‘"  mice  were 
significantly  less  competent,  as  measured  by  the  learning  index, 
T-forward  errors  (making  a  wrong  T-choice),  and  the  total 
number  of  backward  errors  (Fig.  7D).  When  retested  7  or  8 
weeks  later,  Ntanl" mice  had  lower  scores  on  the  first  trial 
on  all  three  measures,  showing  inferior  long-term  retention 
(data  not  shown).  These  results  were  similar  to  the  findings 
with  the  Morris  maze  (see  above),  except  that  there  was  a 
significant  difference  in  the  original  learning  of  the  Lashley 
maze  favoring  the  +/+  mice  (Fig.  7D).  This  difference  may 
stem  from  the  fact  that  the  Lashley  maze  can  be  learned  using 
intramaze  cues  to  memorize  the  pathway  of  left  and  right 
turns,  in  addition  to  the  use  of  extramaze  cues,  which  are 
present  in  both  the  Lashley  and  Morris  maze  tests.  The  results 
of  the  two  radial  arm  studies  (Fig.  7A  to  C)  indicated  that  the 
Ntanl"^"  mice  used  spatial  information  less  effectively  than 
the  +/+  mice.  In  summary,  the  results  of  spatial  memory  tests 
(Fig.  7)  strongly  suggested  that  mouse  Nt^-amidase  contrib¬ 
utes  to  the  processing  of  spatial  information  and  to  long-term 
retention  of  spatial  learning. 


DISCUSSION 

Homologs  of  the  310-residue  mammalian  Nt^-amidase,  en¬ 
coded  by  the  Ntanl  gene  (24,  59),  are  present  in  other  verte¬ 
brates,  in  arthropods  such  as  D.  melanogaster,  and  in  plants 
such  as  A.  thaliana  but  are  apparently  absent  from  the  nema¬ 
tode  Caenorhabditis  elegans  (data  not  shown).  The  deamida¬ 
tion  of  N-terminal  (and  apparently  only  N-terminal)  Asn  res¬ 
idue  in  proteins  or  short  peptides  is  the  only  known  enzymatic 
activity  of  Nt^-amidase.  This  enzyme  was  operationally  de¬ 
fined  as  a  component  of  the  N-end  rule  pathway,  because  it 
converts,  in  vivo,  the  N-terminal  Asn  of  engineered  N-end  rule 
substrates  into  N-terminal  Asp,  which  is  then  conjugated  by 
R-transferase  to  Arg,  a  primary  destabilizing  residue  (34,  63). 
Physiological  substrates  of  Nt^-amidase  remain  to  be  identi¬ 
fied.  Physiological  substrates  are  also  unknown  for  the  fungal 
Nt-amidase,  which  can  deamidate  either  N-terminal  Asn  or 
N-terminal  Gin  and  lacks  significant  sequence  similarities  to 
metazoan  Nt^-amidase  (6).  In  the  present  work,  mouse  Nt^- 
amidase  was  studied  using  several  approaches,  including  tar¬ 
geted  mutagenesis.  We  report  the  following  results. 

(i)  Both  copies  of  the  mouse  Ntanl  gene,  which  encodes 
Nt-amidase,  were  replaced  by  a  deletion/disruption  allele. 
The  resulting  Ntanl"'"  mice  were  viable  and  fertile.  Their 
behavioral  phenotypes  are  described  below. 

(ii)  EF  cells  horn  Ntanl" mice  lacked  Nt^-amidase  activ- 
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ity,  in  contrast  to  congenic  +/+  EF  cells.  In  addition,  among 
the  normally  short-lived  substrates  of  the  N-end  rule  pathway, 
only  those  bearing  N-terminal  Asn,  a  tertiary  destabilizing  res¬ 
idue  (see  the  introduction),  became  long-lived  in  the Ntanl~^~ 
EF  cells.  Thus,  at  least  these  cells,  and  by  inference  the 
Ntanl  ^  mice,  lacked  the  asparagine  branch  of  the  N-end  rule 
pathway. 

^ii)  The  Ntanl~’~  EF  cells  contained  wild-type  levels  of 
Nt^-amidase,  which  mediates  the  activity  of  N-terminal  Gin, 
another  tertiary  destabilizing  residue  in  the  N-end  rule.  Thus, 
Nt  -amidase  and  Nt^-amidase  (the  latter  enzyme  remains  to 
be  isolated  and  characterized)  are  encoded  by  different  genes. 

(iv)  The  brains  and  testes  of  +/+  mice  contained,  respec¬ 
tively,  the  1.4-  and  1,6-kb  Ntanl  mRNAs.  In  addition,  the 
testes  but  not  the  brains  of  +/+  mice  also  contained  the  Ntanl- 
derived  RNAs  of  1.1  and  4  kb.  The  brains  and  testes  of 
Ntanl  ^  mice  lacked,  respectively,  the  1.4-  and  1.6-kb  Ntanl 
mRNAs.  However,  the  testes  of  Ntanr‘~^  mice  retained  the 
testis-specific  1.1-  and  4-kb  Mani -derived  RNAs,  which  were 
found  to  lack  the  5'-half  of  the  Ntanl  ORF  (including  the 
region  deleted  mNtanr^~  mice)  and  to  encompass  the  3'  half 
of  Ntanl  ORF.  Whether  these  testis-specific  transcripts  are 
physiologically  relevant  in  +/+  mice  remains  to  be  deter¬ 
mined. 

(v)  Deletion-disruption  of  Ntanl  in  the  Ntanr'~  mice  did 
not  significantly  change  the  levels  of  mRNAs  encoding  other 
components  of  the  N-end  rule  pathway,  such  as  ATElp  (R- 
transferase),  UBRlp  (E3a,  the  pathway’s  E3  component), 
UBR2p  (a  homolog  of  UBRlp),  and  mHR6Bp  (E2i4k). 

(vi)  Ntanl,  which  is  expressed,  at  different  levels,  in  most  if 
not  all  tissues  of  adult  mice  (24),  was  found  through  in  situ 
hybridization  to  be  particularly  strongly  expressed  in  the 
branchial  arches,  the  tail  bud,  and  the  limb  buds  of  ElO  em¬ 
bryos.  This  expression  pattern  was  indistinguishable  from  that 
of  l^rl  (36),  consistent  with  the  assignment  of  NTANlp  (Nt^- 
amidase)  and  UBRlp  (E3a)  to  the  same  pathway. 

(vii)  An  NTANlp-GFP  fusion  was  enriched  in  the  nucleus  of 
transfected  mouse  3T3  cells,  in  contrast  to  free  GFP,  which  was 
uniformly  distributed.  NTANlp-GFP  was  also  present  in  the 
nucleus-proximal  cytoplasm  but  was  apparently  much  less 
abundant  at  the  cells’  periphery.  This  localization  pattern  of 
the  mammalian  Nt^-amidase  is  quite  different  from  that  of  the 
5.  cerevisiae  NZ47-encoded  Nt-amidase,  most  of  which  is  lo¬ 
cated  in  the  yeast  mitochondria  (H,  R.  Wang  and  A.  Var¬ 
shavsky,  unpublished  data).  In  contrast  to  the  mammalian  Nt^- 
amidase,  the  yeast  Nt-amidase,  which  has  no  significant 
sequence  similarities  to  the  metazoan  enzyme,  can  deamidate 
either  N-terminal  Asn  or  N-terminal  Gin  (6). 

(viii)  While  the  abnormal  phenotypes  oiNtanl~^~  mice  are 
apparently  not  confined  to  behavioral/cognitive  alterations 
(see  Results),  it  is  the  latter  that  have  been  analyzed  in  some 
detail.  Briefly,  the  Ntanl~^~  mice  were  indistinguishable  from 
congenic  +/+  mice  in  motor  coordination  and  general  physical 
performance  but  had  reduced  spontaneous  activity  and  less 
effective  spatial  memory.  Remarkably,  the  exploratory  behav¬ 
ior  of  Ntanl  ^  mice  was  found  to  be  particularly  different 
from  that  of  congenic  +/+  mice  in  the  presence  of  social 
interactions.  If  a  previously  untested  Ntanl~^~  mouse  was 
placed  on  a  small,  slightly  elevated  platform  with  its  +/+ 
littermate  (which  it  grew  up  with  in  the  same  cage),  the 
Ntanl  mouse  left  the  platform  much  more  slowly  than  the 
+/+  mouse.  In  contrast,  when  an  Ntanl~^~  mouse  was  sub¬ 
jected  to  this  test  in  the  presence  of  a  congenic  nonlittermate 
+/+  mouse,  it  tended  to  leave  the  platform  faster  than  the 
+/+  mouse,  even  though  mice  exhibited  diminished 

exploratory  activity  when  tested  alone  (Fig.  6). 


Our  working  h)^othesis  is  that  Ntanr'~  mice  are  socially 
recessive,  in  relation  to  congenic  +/+  mice,  and  in  addition 
have  a  lower  spontaneous  exploratory  activity.  In  a  relatively 
unstress^l  setting,  such  as  when  it  is  placed  on  the  platform 
side-by-sjde  with  a  (familiar)  littermate,  the  behavior  of 
Ntanl  '  mice  is  governed  largely  by  their  inherently  lower 
exploratory  activity,  yielding  the  observation  that  these  mice 
leave  the  platform  much  more  slowly  than  their  +/-h  litter- 
mates  (Fig.  6A).  By  contrast,  in  an  otherwise  identical  test  with 
^  "*"/F  nonlittermate,  the  postulated  social  recessiveness  of 
Ntanl  ^  mice  becomes  their  behavior-governing  trait.  As  a 
result,  they  leave  the  platform  faster  than  the  +/+  nonlitter- 
mates,  to  reduce  the  proximity-induced  anxiety  (Fig.  6B). 

The  socially  conditioned  exploratory  phenotj^e  oiNtanl~^~ 
mice,  identified  through  the  platform-leaving  test  (Fig.  6),  has 
not  been  previously  described,  to  our  knowledge,  with  other 
mouse  strains.  It  is  difficult  to  compare  this  finding  with  be¬ 
havioral  studies  of  other  mouse  mutants,  because  most  of  the 
earlier  analyses  of  e^loratory  behavior  used  solitary  mice.  An 
example  of  differential  effect  of  stress  on  a  specific  mutant  is 
provided  by  the  analysis  of  mice  lacking  the  serotonin  receptor 
lA.  These  mice  exhibited  decreased  exploratory  activity  and 
increased  fear  of  aversive  environments  (in  comparison  to  +/+ 
mice),  as  measured  by  the  open-field  test  and  the  elevated 
plus-inaze  test,  respectively.  However,  the  mutant  mice  were 
indistinguishable  from  +/+  littermates  in  a  less  stressful  set¬ 
ting  such  as  their  home  cages  (50). 

A  parsimonious  molecular  interpretation  of  our  results  (Fig. 
5  to  7)  is  that  a  normally  short-lived  regulatory  protein(s)  that 
is  targeted  for  degradation  by  the  N-end  rule  pathway  through 
the  protein’s  N-terminal  Asn  becomes  long-lived  in  Ntanl 
mice.  The  resulting  increase  in  the  steady-state  concentration 
of  this  protein(s)  alters  the  functioning  of  relevant  neural  net¬ 
works  in  the  adult  brain  or,  nonalternatively,  changes  these 
networks  in  the  course  of  their  establishment  during  develop¬ 
ment.  That  the  inactivation  of  a  Ub-dependent  proteolytic 
pathway  could  affect  cognitive  functions  is  illustrated,  for  ex¬ 
ample,  by  the  identification  of  UBE3A  as  a  human  gene  en¬ 
coding  an  E3  protein  of  the  Ub  system  called  E6AP.  This  gene 
is  mutated  in  the  Angelman’s  syndrome,  the  symptoms  of 
which  include  motor  dysfunction  and  mental  retardation  (33, 
45).  Mice  lacking  UBE3A  exhibit  deficits  in  contextual  learning 
and  long-term  potentiation  (31). 

The  absence  of  severe  impairments  in  the  Ntanl~'~  mice 
should  make  them  particularly  suitable  as  vehicles  for  the  ap¬ 
proaches  to  conditional  mutagenesis  that  use  conditional  de- 
stabilization  of  a  protein  of  interest.  One  version  of  this  ap¬ 
proach  could  use  a  transgenic  mouse  strain  that  lacks 
endogenous  Ntanl  and  expresses  this  gene  from  one  of  the 
previously  constructed  promoters  whose  activity  can  be  con¬ 
trolled  by  small  molecules  such  as  tetracycline  or  ecdysone  (9), 
If  a  protein  of  interest  is  modified,  through  knock-in  mutagen¬ 
esis  and  the  Ub  fusion  technique  (63),  to  bear  an  Asn-contain- 
ing  N-degron,  the  resulting  protein  would  be  long-lived  in  the 
absence  of  the  Mani -encoded  Nt^-amidase  but  short-lived, 
and  therefore  scarce,  in  the  presence  of  Nt^-amidase.  In  this 
method,  the  steady-state  level,  and  hence  the  activity,  of  a 
protein  of  interest  is  controlled  through  -dependent,  reg¬ 
ulated  changes  of  the  protein’s  metabolic  stability.  One  advan¬ 
tage  of  this  strategy  is  that  it  does  not  require  alterations  of  a 
promoter  that  expresses  a  gene  of  interest,  thereby  avoiding 
potential  perturbations  of  the  promoter  regulation  during  de¬ 
velopment  and  differentiation.  This  approach  may  be  com¬ 
bined  with  the  existing  technologies  for  conditional  mutagen¬ 
esis  of  the  mouse  (13,  55),  enhancing  them  through  regulated 
degradation  of  a  protein  of  interest. 
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Inasmuch  as  behavior  is  an  emergent  property  of  highly 
complex  neural  networks  interacting  with  the  musculoskeletal 
apparatus,  a  behavioral  alteration  that  is  caused  by  the  absence 
of  a  specific  protein  from  the  brain  is  difficult  to  understand  in 
terms  of  higher-order  neural  events  even  in  the  case  of  a 
protein  (e.g.,  a  neurotransmitter-gated  ion  channel)  whose  mo¬ 
lecular  function  in  the  individual  neurons  is  clear.  The  difficulty 
is  further  increased  in  the  case  of  a  protein  such  as  Nt^- 
amidase,  whose  physiological  substrates  remain  to  be  identi¬ 
fied.  At  the  same  time,  the  absence  of  a  priori  reasons  to 
suspect  a  specific  role  for  Nt^-amidase  in  the  brain’s  functions 
and  the  robustness  of  behavioral  differences  between  the 
Ntanl^^~  and  congenic  +/+  mice  (Fig.  5  to  7)  make  these 
findings  particularly  intriguing.  Further  advances  in  this  in¬ 
quiry  will  require,  at  minimum,  the  identification  of  physiolog¬ 
ical  substrates  of  Nt^-amidase.  The  availability  of  Ntanl~^~ 
mice  and  cell  lines  derived  from  them  should  facilitate  the 
discovery  of  these  substrates. 
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When  The  Lasker  Foundation  first  approached  Nature  Medicine  with  the  idea  of  working  together  to  present  a  series 
of  articles  from  the  winners  of  their  prestigious  awards,  we  were  immediately  enthusiastic.  The  prospect  of  working 
with  such  celebrated  figures  from  the  biomedical  research  community  is  one  that  would  fill  any  scientific  editor 
with  glee.  This  year,  as  before,  we  have  not  been  disappointed.  The  award  winners  have  again  shown  themselves  not 
only  to  be  exceptional  figures  in  the  community  but  also  accomplished  authors  with  great  stories  to  tell.  We  thank 
them  for  their  generous  contributions  to  these  pages. 

Once  again  the  awards  acknowledge  very  special  scientists  whose  contributions  have  paved  new  paths  of  research, 
significantly  advancing  our  understanding  of  diseases  and  our  quest  for  prevention  and  treatment.  (See  Joseph 
Goldstein^s  Foreword  for  a  synopsis  of  their  remarkable  contributions  to  the  community.)  We  hope  their  articles 
will  inspire  scientists  and  non-scientist  alike  to  redouble  their  commitment  to  biomedical  research  and  to  the  ulti¬ 
mate  goal  of  eradicating  disease. 

Beatrice  Renault 


Laskers  for  2000:  Ubiquitin  Pathway,  Hepatitis  C,  and  Sydney  Brenner 

The  Lasker  Award  in  Basic  Medical  Research  is  given  each  year  The  first  evidence  that  this  pathway  had  a  regulatory  func- 
to  honor  a  fundamental  discovery  that  opens  up  a  new  area  of  tion  in  living  cells  was  provided  in  1984  by  Varshavsky,  who 
biomedical  science.  This  year's  recipients— Aaron  Ciechanover,  traced  the  cell  cycle  arrest  phenotype  in  a  mouse  cell  line  to  a 
Avram  Hershko  and  Alexander  Varshavsky— discovered  and  cessation  of  the  degradation  of  rapidly  turning-over  proteins, 
recognized  the  broad  significance  of  the  ubiquitin  system  of  In  collaboration  with  Ciechanover,  Varshavsky  and  Daniel 
regulated  protein  degradation.  Ciechanover  and  Hershko  are  Finley  showed  that  this  phenotype  was  caused  by  a  tempera- 
biochemists  at  the  Technion-Israel  Institute  of  Technology  in  ture-sensitive  mutation  in  the  ubiquitin-activating  enzyme 
Haifa.  Varshavsky  is  a  molecular  biologist  at  the  California  (El),  genetically  ^cementing'  the  biochemical  model  of 
Institute  of  Technology  in  Pasadena.  Hershko  and  Ciechanover.  Varshavsky  went  on  to  establish  a 

The  discovery  of  the  ubiquitin  system  is  a  'triple-A'  story  be-  genetic  system  in  yeast  that  showed  the  first  set  of  rules  that 
fitting  the  three  protagonists.  When  they  began  their  work  in  specify  which  proteins  undergo  ubiquitination. 

1977-1978,  the  genetic  code  had  been  deciphered,  the  mecha-  The  physiological  importance  of  the  ubiquitin  system  has 
nism  of  protein  synthesis  had  been  delineated  and  the  tech-  now  been  established  for  many  cellular  events  involving  the 
nologies  for  cloning  and  sequencing  genes  had  been  discovered,  cell  cycle,  antigen  processing,  inflammation,  gene  transcrip- 
But  almost  nothing  was  known  about  the  mechanism  of  intra-  tion  and  oncogenesis.  Its  discovery  is  reminiscent  of  the  dis- 
cellular  protein  degradation.  Then  in  1978,  Hershko  and  covery  of  protein  phosphorylation.  When  Fischer  and  Krebs 
Ciechanover,  working  with  reticulocyte  extracts,  discovered  first  reported  protein  phosphorylation  in  the  mid-1950s,  it 
that  proteins  could  be  targeted  for  degradation  by  the  covalent  was  thought  to  be  limited  to  one  or  two  enzymes  in  glycogen 
attachment  of  a  76-amino-acid  protein,  later  identified  as  ubiq-  metabolism.  Later  work  demonstrated  its  universality,  and 
uitin.  In  a  series  of  elegant  biochemical  papers  over  the  next  5  today  we  recognize  that  phosphorylation  regulates  nearly  all 
years,  they  showed  that  this  unprecedented  type  of  protein  at-  metabolic  pathways.  The  same  seems  to  be  true  for  the  ubiq- 
tachment  required  the  action  of  three  enzymes  that  sequen-  uitin-mediated  proteolytic  pathway. 

tially  activate  (El),  conjugate  (E2)  and  ligate  (E3)  ubiquitin  to  The  Lasker  Clinical  Medical  Research  Award  is  given  each 
protein  substrates.  They  also  discovered  that  ubiquitin-protein  year  to  honor  a  scientific  contribution  that  has  improved  the 
conjugates  are  degraded  by  an  ATP-dependent  protease,  now  treatment  of  patients  in  alleviating  or  eliminating  a  major 
known  to  be  the  multi-subunit  26S  proteasome.  medical  disease.  This  year's  recipients— Harvey  Alter  of  the 
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National  Institutes  of  Health  and  Michael  Houghton,  leader 
of  a  scientific  team  from  Chiron  Corp. — did  pioneering  work 
that  led  to  the  discovery  of  the  hepatitis  C  virus  and  the  de¬ 
velopment  of  screening  methods  that  reduced  the  risk  of 
blood  transfusion-associated  hepatitis  in  the  US  from  30%  in 
1970  to  nearly  zero  in  2000. 

Harvey  Alter  has  studied  the  biology  of  transfusion  hepatitis 
for  40  years,  during  which  time  he  acquired  a  priceless  collec¬ 
tion  of  'pedigreed'  blood  samples — blood  collected  from  the 
donor  of  a  transfusion  and  from  the  donor's  recipient.  In  the 
mid-1970s  he  and  his  collaborators  at  the  National  Institutes  of 
Health,  Robert  Purcell  and  Stephen  Feinstone,  used  these  'pedi¬ 
greed'  blood  samples  to  show  that  transfusion  hepatitis  was  not 
caused  by  either  the  hepatitis  A  virus  or  the  hepatitis  B  virus, 
but  by  some  unidentified  virus  (or  viruses),  which  they  called 
non-A  non-B.  In  studying  the  natural  history  of  non-A  non-B 
hepatitis.  Alter  showed  that  20%  of  infected  people  went  on  to 
develop  cirrhosis,  a  serious  liver  disease.  In  an  attempt  to  iso¬ 
late  the  putative  virus.  Alter  and  Purcell  injected  chimpanzees 
with  samples  from  patients  with  non-A  non-B  hepatitis  and 
demonstrated  that  the  injected  animals  developed  hepatitis. 

Despite  the  early  success  in  transmitting  the  non-A  non-B 
virus  to  animals,  a  conclusive  identification  proved  elusive 
until  1985.  Here  the  story  shifts  to  Michael  Houghton's  labora¬ 
tory  at  the  Chiron  Corp.,  where  Houghton  and  his  colleagues 
Qui-Lim  Choo  and  George  Kuo  embarked  on  a  risky  strategy 
aimed  at  molecularly  cloning  the  genome  of  the  putative 
virus.  Starting  with  infected  chimpanzee  tissue  provided  by 
Daniel  Bradley  of  the  Centers  for  Disease  Control,  the  Chiron 
team  essentially  did  a  'blind  cloning'.  They  made  cDNA  li¬ 
braries  from  the  chimpanzee  tissues,  introduced  them  into 
bacteria  and  screened  for  expressed  proteins  that  bound  hypo¬ 
thetical  antibodies  from  patients  infected  with  non-A  non-B 
hepatitis  but  not  from  uninfected  individuals.  After  2  years  of 
screening  tens  of  millions  of  bacterial  clones,  they  eventually 


identified  one  clone  that  encoded  a  fragment  of  the  genome  of 
an  RNA  virus  belonging  to  the  Flaviviradae  family,  now 
known  to  be  the  hepatitis  C  virus.  Not  only  did  Houghton, 
Choo  and  Kuo  succeed  in  identifying  the  hepatitis  C  virus,  but 
they  also  demonstrated  the  power  of  'blind  cloning'  as  a 
method  to  identify  an  infectious  agent  that  had  never  been 
grown  in  the  laboratory,  never  been  visualized  microscopically 
and  never  been  identified  serologically.  If  ever  there  were  a 
technical  tour  de  force  of  molecular  cloning,  this  was  it! 

The  piece  de  resistance  of  the  story  came  in  1989  when  the 
Chiron  team  and  Alter  collaborated,  using  Chiron's  new  sero¬ 
logical  reagents  and  Alter's  'pedigreed'  blood  samples,  to 
demonstrate  that  the  new  hepatitis  C  virus  was  indeed  the 
cause  of  most  cases  of  non-A  non-B  hepatitis.  Since  1990,  reli¬ 
able  tests  for  screening  the  blood  supply  for  hepatitis  C  have 
been  in  routine  use  throughout  the  world. 

The  Lasker  Special  Achievement  Award  in  Medical  Science 
is  a  new  award  that  was  inaugurated  in  1994  and  is  given  peri¬ 
odically  to  honor  a  scientist  whose  lifetime  contributions  to 
biomedical  research  are  universally  admired  and  respected  for 
their  creativity,  importance  and  impact.  Previous  recipients 
have  included  Maclyn  McCarty  (1994),  Paul  C.  Zamecnik 
(1996),  Victor  A.  McKusick  (1997),  Daniel  E.  Koshland,  Jr. 
(1998)  and  Seymour  Kety  (1999). 

The  recipient  of  this  year's  award  is  Sydney  Brenner.  Brenner 
is  cited  for  50  years  of  brilliant  creativity  in  biomedical  science, 
exemplified  by  his  legendary  work  on  the  genetic  code;  his  dar¬ 
ing  introduction  of  the  roundworm  Caenorhabditis  elegans  as  a 
system  for  tracing  the  birth  and  death  of  every  cell  in  a  living 
animal;  his  rational  voice  in  the  debate  on  recombinant  DNA; 
and  his  trenchant  wit.  Additional  information  on  Brenner  is 
available  in  Remarks  from  the  2000  Lasker  Awards  Ceremony 
(http://www.laskerfoundation.eom/library/2000/remarks3.html). 

Joseph  L.  Goldstein 
Chair,  Lasker  Awards  fury 
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The  ubiquitin  system 

Avram  Hershko,  Aaron  Ciechanover  &  Alexander  Varshavsky 


Ubiquitin-mediated  protein  degradation:  The  early  days 

It  has  been  often  stated  that  until  re-  vinced,  however,  that  lysosomal  au- 

cently  the  ubiquitin  system  was  thought  AvRAM  HeRSHKO  tophagy  cannot  account  for  the  selectiv- 


to  be  mainly  a  'garbage  disposal'  for  the 
removal  of  abnormal  or  damaged  proteins.  This  statement  is 
certainly  not  true  for  those  who  have  been  interested  in  the  se¬ 
lective  and  regulated  degradation  of  proteins  in  cells.  The  dy¬ 
namic  turnover  of  cellular  proteins  was  discovered  in  the 
pioneering  studies  of  Rudolf  Schoenheimer  in  the  1930s,  when 
he  first  used  isotopically  labeled  compounds  for  biological 
studies*.  Between  1960  and  19 70  it  became  evident  that  protein 
degradation  in  animal  cells  is  highly  selective,  and  is  important 
in  the  control  of  specific  enzyme  concentrations^  The  molecu¬ 
lar  mechanisms  responsible  for  this  process,  however,  re¬ 
mained  unknown.  Some  imaginative  models  have  been 
proposed  to  account  for  the  selectivity  of  protein  degradation, 
such  as  one  suggesting  that  all  cellular  proteins  are  rapidly  en¬ 
gulfed  into  the  lysosome,  but  only  short-lived  proteins  are  de¬ 
graded  in  the  lysosome,  whereas  long-lived  proteins  escape 
back  to  the  cytosoP. 

I  became  interested  in  the  mechanisms  of  intracellular  pro¬ 
tein  breakdown  when  I  was  a  post-doctoral  fellow  in  the  labo¬ 
ratory  of  Gordon  Tomkins  30  years  ago  (1969-1971).  At  that 
time,  the  main  subject  in  that  laboratory  was  the  mechanism 
by  which  corticosteroid  hormones  cause  the  increased  synthe¬ 
sis  of  the  enzyme  tyrosine  aminotransferase.  I  found  this  sub¬ 
ject  a  bit  crowded,  so  I  chose  to  study  a  different  process  that 
also  regulates  tyrosine  aminotransferase  concentration:  the 
degradation  of  this  enzyme.  I  found  that  the  degradation  of  ty¬ 
rosine  aminotransferase  in  cultured  hepatoma  cells  is  com¬ 
pletely  arrested  by  inhibitors  of  cellular  energy  production, 
such  as  fluoride  or  azide'*.  These  results  confirmed  and  ex¬ 
tended  the  previous  observations  of  Simpson  on  the  energy  de¬ 
pendence  of  the  release  of  amino  acids  from  liver  slices\ 
Similar  energy  requirements  for  the  degradation  of  many  other 
cellular  proteins  were  subsequently  found  in  a  variety  of  exper¬ 
imental  systems^. 

I  was  very  impressed  by  the  energy  dependence  of  intracellu¬ 
lar  protein  breakdown,  because  proteolysis  itself  is  an  exer- 
gonic  process  that  does  not  require  energy.  I  assumed  that 
there  was  an  as-yet-unknown  proteolytic  system  that  uses  en¬ 
ergy  for  the  highly  selective  degradation  of  proteins.  After  re¬ 
turning  to  Israel  in  1971  and  setting  up  my  laboratory  at  the 
Technion,  my  main  goal  was  to  identify  the  energy-dependent 
system  responsible  for  the  degradation  of  cellular  proteins.  It 
took  a  bit  of  faith  to  base  my  entire  research  project  on  the  ef¬ 
fects  of  energy  'poisons',  because  these  inhibitors  could  affect 
protein  breakdown  rather  indirectly.  For  example,  I  remember 
that  when  Racker,  a  great  biochemist,  visited  my  laboratory  in 
Haifa  in  the  mid-1970s,  he  dismissed  these  observations  as 
being  secondary  to  the  inhibition  of  the  proton  pump,  which 
maintains  the  acidic  environment  in  lysosomes.  I  was  con- 


ity  and  regulation  of  intracellular  protein 
breakdown.  I  was  also  convinced  that  the  best  way  to  identify  a 
new  system  was  that  of  classical  biochemistry:  to  reproduce 
ATP-dependent  protein  breakdown  in  a  cell-free  system  and 
then  to  fractionate  such  a  system  and  to  find  the  mode  of  ac¬ 
tion  of  its  components. 

An  ATP-dependent  proteolytic  system  from  reticulocytes  was 
first  described  by  Etlinger  and  Goldberg^  and  then  was  ana¬ 
lyzed  by  our  biochemical  fractionation-reconstitution  studies. 
In  this  work,  I  was  greatly  helped  by  Aaron  Ciechanover,  who 
was  then  my  graduate  student.  Substantial  support  and  advice 
were  provided  by  Irwin  Rose,  who  hosted  me  in  his  laboratory 
in  Fox  Chase  Cancer  Center  for  a  sabbatical  year  in  1977-1978 
and  many  times  afterwards.  Initially,  reticulocyte  lysates  were 
fractionated  on  DEAE-cellulose  into  two  crude  fractions:  frac¬ 
tion  1,  which  was  not  adsorbed,  and  fraction  2,  which  con¬ 
tained  all  proteins  adsorbed  to  the  resin  and  eluted  with  high 
salt.  The  original  aim  of  his  fractionation  had  been  to  remove 
hemoglobin  (present  in  fraction  1),  but  we  found  that  fraction 
2  lost  most  of  ATP-dependent  proteolytic  activity.  Activity 
could  be  restored  by  combining  fractions  1  and  2.  The  active 
component  in  fraction  1  was  a  small  protein  that  we  purified 


Fiq.  1.  Discovery  of  the  llqa- 

tion  of  ubiquitin  to  lysozyme,  ^  j  ^  o  o  / 

a  substrate  of  the  proteolytic 
system.  Reaction  products 
were  separated  by 
SDS-PAGE.  Lane  1,  incuba¬ 
tion  of  *^^l-labeled  ubiquitin 
with  fraction  1  in  the  absence 
of  ATP;  ubiquitin  remains  free 
and  migrates  at  the  front. 

Lanes  2-5,  incubation  of  ^^^1- 
labeled  ubiquitin  with  frac¬ 
tion  1  in  the  presence  of  ATP. 

Lane  2,  ubiquitin  becomes 
covalently  linked  to  many 
high-molecular-weight  deriv¬ 
atives,  presumably  endoge¬ 
nous  protein  substrates 
present  in  fraction  2.  Lanes 
3-5,  several  new  labeled 
bands  appear  (C1-C5), 
which  increase  with  increas¬ 
ing  concentrations  of 

lysozyme.  Lanes  6  and  7,  incubation  of  *^^l-labeled  lysozyme  with  fraction  2 
in  the  absence  of  ATP  (lane  6)  or  with  ATP  and  unlabeled  ubiquitin  (lane  7); 
bands  C1-C5  contain  the  label  of  *^^l-labeled  lysozyme  and  consist  of  in¬ 
creasing  numbers  of  ubiquitin  molecules  ligated  to  lysozyme.  Reproduced 
from  ref.  1 3,  with  permission. 
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Fig.  2.  The  ubiquitin  pathway,  then  and  now.  o.  In  the  original 
model,  the  following  enzymatic  steps  were  proposed:  1,  'APF-1- 
protein  amide  synthetase'  ligates  n  molecules  of  APF-1 -ubiquitin 
to  Lys  e-amino  groups  of  the  protein  substrate.  2,  A  'correction' 
amidase  (isopeptidase)  releases  free  protein  and  APF-1 -ubiquitin 
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from  erroneous  ligation  products.  3,  An  endopeptidase  (protease)  specifically  acts  on  proteins  ligated  to  several  molecules  of  APF-1  and  cleaves  pep¬ 
tide  bonds  with  the  liberation  of  APF-1  still  linked  to  Lys  or  a  Lys-containing  peptide  (APF-1 -X).  4,  Amidase  (isopeptidase)  cleaves  the  bond  between 
APF-1  and  the  e-amino  group  of  Lys  residues  and  thus  liberates  reusable  APF-1 -ubiquitin.  Reproduced  from  ref.  1 3,  with  permission,  b,  Current  infor¬ 
mation  on  the  enzymatic  reactions  of  the  ubiquitin  system.  Steps  1,  2  and  3,  accomplished  by  El,  E2  and  E3,  correspond  to  step  1  of  the  original 
model.  Step  4,  accomplished  by  the  26S  proteasome,  corresponds  to  step  3  of  the  original  model.  Steps  5,  6  and  7,  accomplished  by  ubiquitin-car- 
boxy-terminal  hydrolases  (isopeptidases),  correspond  to  steps  2  and  4  of  the  original  hypothesis.  Reproduced  from  ref.  15,  with  permission. 


by  taking  advantage  of  its  remarkable  stability  to  heat  treat¬ 
ment®.  It  was  first  called  APF-1,  for  ATP-dependent  proteolysis 
factor  1.  The  identification  of  APF-1  as  ubiquitin  was  later  made 
by  Wilkinson  and  co-workers^,  after  our  discovery  of  its  ligation 
to  proteins.  Ubiquitin  was  first  thought  to  be  a  thymic  hor¬ 
mone,  but  subsequently  was  found  to  be  present  in  many  tis¬ 
sues  and  organisms,  hence  its  name^°.  It  was  found  to  be 
conjugated  to  histone  2A  (ref.  11),  but  its  functions  remained 
unknown.  Although  we  did  not  know  at  that  time  that  APF-1 
was  ubiquitin,  I  will  use  the  term  ubiquitin  here  to  facilitate  the 
discussion. 

The  purification  of  ubiquitin  from  fraction  1  was  the  key  to 
the  elucidation  of  the  mode  of  its  action  in  the  proteolytic  sys¬ 
tem.  At  first  I  thought  that  it  could  be  an  activator,  or  a  regula¬ 
tory  subunit  of  a  protease  or  other  enzyme  component  of  the 
system  present  in  fraction  2.  To  examine  this  possibility,  puri¬ 
fied  ubiquitin  was  radioiodinated  and  incubated  with  crude 
faction  2  in  the  presence  or  absence  of  ATP.  There  was  substan¬ 
tial  ATP-dependent  binding  of  ‘^T-labeled  ubiquitin  to  high- 
molecular-weight  proteins  by  gel  filtration  chromatography^^. 
However,  a  covalent  amide  linkage  was  unexpectedly  formed, 
as  shown  by  the  stability  of  the  'complex'  to  treatment  with 
acid,  alkali,  hydroxylamine  or  boiling  with  SDS  and  mercap- 
toethanoP^.  Analysis  of  the  reaction  products  by  SDS-PAGE 
showed  that  ubiquitin  was  ligated  to  many  high-molecular- 
weight  proteins.  Because  crude  fraction  2  from  reticulocytes 
contains  not  only  enzymes  but  also  endogenous  substrates  of 
the  proteolytic  system,  we  began  to  suspect  that  ubiquitin 
might  be  linked  to  protein  substrates,  rather  than  to  an  en¬ 
zyme.  In  support  of  this  interpretation,  we  found  that  proteins 
that  are  good  (although  artificial)  substrates  for  ATP-dependent 
proteolysis,  such  as  lysozyme,  form  several  conjugates  with 
ubiquitin In  the  original  experiment  that  convinced  us  that 
ubiquitin  is  ligated  to  the  protein  substrate,  similar  high-mole¬ 
cular-weight  derivatives  were  formed  when  ^^T-labeled  ubiqui¬ 
tin  was  incubated  with  unlabeled  lysozyme  (Fig.  1,  lanes  3~5), 
and  when  ^^^I-labeled  lysozyme  was  incubated  with  unlabeled 
ubiquitin  (Fig.  1,  lane  7).  Analysis  of  the  ratio  of  radioactivity 
in  ubiquitin  and  lysozyme  indicated  that  the  various  deriva¬ 
tives  consisted  of  increasing  numbers  of  ubiquitin  molecules 
linked  to  one  molecule  of  lysozyme.  On  the  basis  of  these  find¬ 


ings,  we  proposed  a  model  in  1980  (Fig.  2a)  in  which  several 
molecules  of  APF-l-ubiquitin  are  linked  to  Lys  e-amino  groups 
of  the  protein  substrate  by  an  'APF- 1-protein  amide  synthetase' 
(Fig.  2a,  step  1).  We  proposed  that  proteins  ligated  to  several 
ubiquitins  were  broken  down  by  a  specific  protease  that  recog¬ 
nizes  such  conjugates  (Fig.  2a,  step  3).  Thus,  the  protein  would 
be  broken  down  to  free  amino  acids  and  to  APF-l-ubiquitin 
still  linked  by  isopeptide  linkage  to  Lys  or  a  small  peptide  (APF- 
1-X).  Finally,  free  APF-l-ubiquitin  is  released  for  re-use  by  the 
action  of  a  specific  amidase/isopeptidase  (Fig.  2a,  step  4).  Based 
on  a  suggestion  by  Ernie  Rose,  we  added  a  hypothetical  'cor¬ 
recting'  isopeptidase  to  this  scheme,  which  would  release  free 
ubiquitin  and  substrate  protein  from  products  of  erroneous 
ubiquitin-protein  ligation  (Fig.  2a,  step  2).  An  isopeptidase 
that  may  have  such  correction  function  was  described  re- 
cently^^. 

Comparison  of  the  original  model  with  our  current  knowl¬ 
edge  of  the  reactions  of  the  ubiquitin  pathway^ ^  (Fig.  21?)  shows 
that  the  original  model  was  essentially  correct,  but  much  fur¬ 
ther  detail  provides  explanation  for  the  high  selectivity  of  ubiq- 
uitin-mediated  protein  degradation.  Thus,  we  have  found  that 
'APF-1 -protein  amide  synthetase'  is  actually  composed  of  three 
types  of  enzymes:  a  ubiquitin-activating  enzyme  El,  a  ubiqui- 
tin-carrier  protein  E2  and  a  ubiquitin-protein  ligase  E3  (ref.  16). 
Specific  E3  enzymes  recognize  specific  structural  features  in 
specific  protein  substrates,  and  thus  account  for  substrate  selec- 
tivity^L  Proteins  ligated  to  multi-ubiquitin  chains  are  degraded 
by  a  26S  proteasome  complex  discovered  by  Rechsteiner  and 
co-workers^®.  ATP  is  needed  not  only  for  the  ubiquitin-protein 
ligation  reaction,  as  originally  proposed,  but  also  for  the  action 
of  the  26S  proteasome^®.  Finally,  free  and  reusable  ubiquitin  is 
released  by  the  action  of  a  large  variety  of  ubiquitin  C-terminal 
hydrolases^^ 

As  indicated  before^^  the  main  lesson  from  our  story  is  the 
continued  importance  of  the  use  of  biochemistry  in  modern 
biomedical  research.  Without  biochemistry,  it  is  doubtful 
whether  an  entirely  new  system  could  have  been  discovered. 
On  the  other  hand,  molecular  genetics  has  been  essential  in 
discovering  the  many  functions  of  this  system  in  processes 
such  as  cell  cycle  control,  signal  transduction  and  the  immune 
response^^ 
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The  ubiquitin-proteolytic  pathway:  From  obscurity  to  the  patient  bed 


Traditionally,  researchers  in  the  field  of  terial  deca-peptide  antibiotic  gramicidin  S 

proteolysis  have  tried  to  characterize  and  AaRON  CiECHANOVER  (ref.  25).  The  similarities  between  the 


purify  a  single  protease  while  pursuing 
the  activity  they  are  studying.  1  used  a  similar  approach  at  the 
beginning  of  my  studies  as  a  graduate  student  with  Avram 
Hershko,  when  we  initiated  our  efforts  to  purify  the  non-lysoso- 
mal,  ATP-dependent  proteolytic  activity  that  Etlinger  and 
Goldberg^  and  we^®  had  characterized  earlier  in  reticulocyte 
lysate.  An  obvious  first  step  during  the  purification  of  proteins 
from  red  blood  cells  is  to  remove  hemoglobin,  the  main  protein 
in  the  extract,  by  anion  exchange  chromatography.  When  we 
fractionated  the  lysate,  we  could  not  recover  the  proteolytic  ac¬ 
tivity  in  either  the  flow-through  material  that  contained  hemo¬ 
globin  (fraction  I)  or  in  the  adsorbed,  high-salt-desorbed 
material  (fraction  II).  The  proteolytic  activity  could  be  recov¬ 
ered,  however,  after  the  addition  of  both  fractions  I  and  II  to  the 
reaction  mixture®  (Table  1).  This  was  a  critical  experiment,  as  it 
taught  us  that  this  complex  system  contained  at  least  two  (and 
if  two,  possibly  more,  as  was  later  shown  to  be  true)  comple¬ 
mentary  factors,  and  not  a  single,  'traditionaT  protease  that  was 
also  ATP-dependent.  From  then  on,  we  used  the  power  of  'clas¬ 
sical'  biochemistry  and  a  'complementing-add-and-subtract'  ap¬ 
proach  to  initially  purify  from  fraction  I  ATP-dependent 
proteol5Tic  factor-1  (APF-1;  ref.  21).  We  called  the  protein  APF-1, 
as  it  was  the  first  factor  we  characterized.  At  the  same  time  we 
started  to  identify  additional  complementing  factors,  and 
looked  for  a  simple  terminology  to  enable  convenient  commu¬ 
nication  in  the  laboratory. 

APF-1  was  later  identified  as  ubiquitin^^  a  known  protein  of 
previously  unknown  function.  Keith  Wilkinson  and  Arthur  Haas 
were  post-doctoral  fellows  with  Irwin  A.  Rose  at  the  Fox  Chase 
Cancer  Center  in  Philadelphia,  where  Avram  Hershko  spent  a 
sabbatical  in  1977-1978  and,  later,  his  summers.  I  joined  him 
during  the  summers  of  1978-1981.  Wilkinson  and  Haas  were  fas¬ 
cinated  by  the  new  type  of  post-translational  modification  by 
APF-1  and,  along  with  Michael  Urban,  who  worked  in  a  neigh¬ 
boring  laboratory  studying  histones,  identified  APF-1  as  ubiqui- 
tin.  It  was  known  that  ubiquitin  generates  a  single  modified 
adduct  with  histones  H2A  and  H2B;  however,  the  function  of 
these  adducts  has  unexpectedly  remained  obscure  to  these  very 
days.  We  adopted  the  existing  terminology,  and  APF-1  became 
ubiquitin.  In  parallel,  we  showed  that  multiple  molecules  of  the 
protein  are  conjugated  covalently  to  the  target  substrate  and  sug¬ 
gested  that  they  serve  as  a  degradation  signaP^'^®.  Initially  we 
thought  that  each  ubiquitin  moiety  attaches  to  a  single  internal 
Lys  in  the  target  molecule.  Later  Hershko^®  and  then 
Varshavsky^'^  and  their  colleagues  demonstrated  the  formation  of 
a  poly-ubiquitin  chain  anchored  to  a  single  internal  Lys  residue. 
In  the  conjugation  reaction,  the  C-terminal  group  of  ubiquitin 
(Gly)  generates  a  high-energy,  isopeptide  bond  with  an  e-amino 
group  of  an  internal  Lys  residue  of  the  substrate.  Later  we 
showed  that  the  first  conjugation  event  can  also  involve  the  N- 
terminus  residue  of  the  protein.  We  then  went  on  to  dissect  the 
mechanism  that  underlies  the  activation  of  ubiquitin  that  must 
precede  the  generation  of  the  high-energy  bond  with  the  target 
protein.  Here,  we  made  use  of  the  known  mechanism  of  amino- 
acid  activation  during  protein  synthesis,  but  mostly  of  the  mech¬ 
anism  shown  by  Fritz  Lipmann  and  his  colleagues  when  they 
reconstituted  a  cell-free,  non-ribosomal  biosynthesis  of  the  bac- 


three  mechanisms  of  activation  of  animo 
acids  for  protein  and  peptide  synthesis  and  ubiquitin  were  strik¬ 
ingly  similar.  Dissection  of  the  activation  mechanism  of  ubiqui¬ 
tin  paved  the  road  to  purification,  by  reaction-based,  'covalent' 
affinity  chromatography  over  immobilized  ubiquitin,  of  the 
three  sequentially  acting,  conjugating  enzymes:  the  ubiquitin- 
activating  enzyme  El,  the  ubiquitin  carrier  protein  E2  and  the 
ubiquitin  protein  ligase  E3,  which  accomplishes  the  last  and 
most-essential  step  in  the  conjugation  reaction,  specific  ligation 
of  ubiquitin  to  the  target  protein^®'^®.  In  1980,  before  the  identifi¬ 
cation  of  APF-1  as  ubiquitin  and  the  dissection  of  the  conjuga¬ 
tion  mechanism,  we  had  already  proposed  a  modeP®  (Fig.  2a) 
according  to  which  conjugation  of  multiple  APF-1  molecules  tar¬ 
gets  the  substrate  to  degradation  by  an  unknown,  yet  conjugate- 
specific,  downstream  protease.  Intact  APF-1  that  can  be  re-used  is 
recycled  through  the  activity  of  isopeptidases.  The  protease,  the 
26S  proteasome  complex,  was  characterized  and  purified  later  by 
Martin  Rechsteiner  and  colleagues^®.  In  1982,  a  more-detailed 
model  was  described,  and  the  system  began  to  be  placed  in  its  ap¬ 
propriate  biological  context^.  In  particular,  its  functions  were  an¬ 
alyzed  and  compared  to  those  of  the  lysosomes  involved  mostly 
in  the  degradation  of  extracellular  proteins  taken  up  through 
pinocytosis  and  receptor-mediated  endocytosis^. 

The  physiological  relevance  of  our  initial  findings  remained 
unknown,  as  until  that  time  (1981)  we  did  all  our  studies  in  a 
cell-free  reconstituted  system,  using  secretory  (and  not  intracel¬ 
lular;  as  these  were  available  in  large  amounts  and  low  cost)  pro¬ 
teins  as  model  substrates.  The  first  evidence  that  the  system  is 
involved  in  degradation  of  proteins  in  vivo  came  from  immuno¬ 
chemical  analysis  of  ubiquitin  adducts  in  cells^^.  Using  antibod¬ 
ies  raised  against  ubiquitin,  we  showed  that,  after  incubation  of 
cells  in  the  presence  of  amino-acid  analogs,  the  resulting  abnor¬ 
mal  proteins  are  short-lived.  Their  rapid  degradation  is  accompa¬ 
nied  by  a  transient,  yet  substantial  increase  in  the  level  of 
ubiquitin  adducts,  strongly  indicating  that  they  serve  as  essential 
intermediates  in  the  proteolytic  process.  Later,  stronger  and 
more-direct  proof  of  the  involvement  of  the  ubiquitin  system  in 
the  degradation  cellular  proteins  came  from  the  observation 
Alexander  Varshavsky,  Daniel  Finely  and  I  made  that  a  cell-cycle 
arrest  mutant  that  contains  a  thermolabile  El  enzyme  is  also  de¬ 
fective  in  the  degradation  of  short-lived  abnormal  proteins  at  the 
non-permissive  temperature^®'^^. 

An  important  yet  unresolved  problem  at  that  time  involved 
the  identification  of  the  specific  signaling  motifs  that  target  pro¬ 
teins  for  degradation.  My  initial  entry  into  this  fascinating  area 


Table  1  Anion  exchange  chromatograph ical  resolution  of 
reticulocyte  lysate  into  unadsorbed  (fraction  I),  and  adsorbed,  high- 
salt-desorbed  (fraction  II)  complementing  proteolytic  activities. 


Enzyme  fraction  used  Percent  degradation  of  labeled  globin 


(-)ATP 

(+)ATP 

Complete  lysate 

1.5 

10.0 

Fraction  1  (flow-through  material) 

0 

0 

Fraction  11  (high  salt  desorbed  eluate) 

1.5 

2.7 

Fraction  1  +  Fraction  II 

1.6 

10.6 

Adapted  with  modifications  from  ref.  8. 
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Pathogenesis  of  Ubiquitin  System-Related  Diseases 


Disease  Normal  function 


Disease 


was  unanticipated.  A  large-scale  purification  of  APF-1  carried  out 
in  Haifa  in  1980  showed  a  substantial  discrepancy  between  its 
high  dry  weight  and  low  protein  content,  as  determined  by  a 
Lowry  assay.  Hershko  and  I  thought  that  the  protein  was  proba¬ 
bly  a  ribonucleoprotein  complex,  and  the  excess  non-protein 
mass  was  due  to  the  RNA  component.  Treatment  of  the  APF-1 
preparation  with  RNase  A  led  to  abrogation  of  its  stimulatory  ac¬ 
tivity  towards  bovine  serum  albumin  (BSA),  but  not  lysozyme. 
Not  appreciating  at  that  time  the  extent  of  the  high  specificity  of 
the  system  toward  its  different  substrates,  we  could  not  explain 
the  'selective'  RNase  effect,  and  did  not  pursue  the  study.  The 
discrepancy  between  weight  and  protein  measurement  was  re¬ 
solved  later  with  the  finding  that  ubiquitin  has  a  low  content  of 
the  aromatic  amino  acids  that  are  the  basis  of  every  known 
method  for  protein  measurement.  Obviously,  ubiquitin  is  a  pure 
protein  and  not  a  ribonucleoprotein  complex.  Later,  our  studies 
showed  that  nuclease  added  to  the  ubiquitin  preparation  de¬ 
stroyed  tRNA^'®  that  was  necessary,  along  with  arginyl-tRNA  pro¬ 
tein  transferase,  to  convert  the  N-terminal  acidic  (Asp)  residue  of 
BSA  to  Arg  (refs.  30,31).  Only  the  modified  BSA  and  not  the  wild- 
type  BSA  can  bind  to  the  'basic'  N-terminal  binding  site  of  E3a, 
the  ubiquitin  ligase  involved  in  recognition  via  the  N-terminal 
residue  (the  N-end  rule  pathway  ligase;  ref.  15  and  see  below 
commentary  by  A.  Varshavsky).  Lysozyme,  with  a  Lys  at  the  N- 
terminal  residue,  does  not  undergo  this  post-translational  modi¬ 
fication  and  is  recognized  directly  by  E3a,  and  its  degradation, 
therefore,  is  not  sensitive  to  RNase.  This  finding  became  part  of 
the  more  thorough  and  systematic  mode  of  recognition  identi¬ 
fied  through  genetic  tools  by  Alexander  Varshavsky  and  his  col¬ 
leagues,  and  is  known  as  the  N-end-rule  pathway^^.  Earlier, 
Hershko  also  noted  the  importance  of  an  exposed  N-terminal 
residue  in  targeting  certain  model  proteins  for  degradation^^  but 
at  that  time,  the  mechanistic  relevance  of  this  finding  was  not 
apparent.  Although  most  known  substrates  of  the  ubiquitin  sys¬ 
tem  are  targeted  through  different  recognition  motifs^^  the  N- 
end-rule  pathway  was  the  first  well-defined  signal  of  substrate 
recognition  (see  commentary  by  A.  Varshavsky). 

As  for  the  myriad  cellular  substrates  and  regulatory  functions 
of  the  ubiquitin  system  now  known,  it  was  only  in  the  early 
1990s  that  scientists  started  to  unravel  these  secrets.  The  discov¬ 
ery  of  oncoproteins,  tumor  suppressors,  transcriptional  factors 
and  cell-cycle  regulators  have  shown  that  all  these  proteins  are 
short-lived  and  their  stability  is  tightly  regulated.  This  previously 
unknown  mode  of  regulation  through  destruction,  which,  un¬ 
like  phosphorylation,  is  irreversible  and  may  have  evolved  to  se¬ 
cure  directionality,  has  attracted  many  scientists  to  study  the 


Fig.  3  Accelerated  or  decreased  rates  of  ubiqultin-mediated  proteolysis 
can  underlie  the  pathogenesis  of  human  diseases.  Protein  substrates  are 
degraded  each  at  a  distinct  and  specific  rate  that  may  vary  in  different  con¬ 
ditions  (blue  arrows),  and  maintain  a  steady-state  level  (green  boxes)  that 
enables  them  to  function  properly.  Accelerated  degradation  (thick  red 
arrow),  as  occurs  in  HPV  E6-targeted  degradation  of  p53,  results  in  a  low 
steady-state  level  of  the  target  substrate  (small  beige  box)  and  exposure  of 
the  cell  to  malignant  transformation.  Decreased  degradation  (narrow  red 
arrow  with  a  blue  X)  can  occur  when  the  signaling  motif  in  the  substrate  is 
mutated  (mutations  in  the  phosphorylation  sites  in  p-catenin  in  certain 
cases  of  malignant  melanoma  or  colorectal  carcinoma,  or  mutations  in  the 
NEDD4  E3  recognition  motif  of  the  kidney  epithelial  sodium  channel  in 
Liddle  hypertension  syndrome);  or  when  E3  is  mutated  (as  in  Angelman 
syndrome,  in  which  E6-AP  is  mutated).  In  all  these  examples,  the  excess  ac¬ 
cumulated  substrate  (large  beige  box)  is  'toxic'. 


systems  and  signals  that  govern  the  stability  (and  hence  the  ac¬ 
tivity)  of  many  different  proteins^^'^^.  Without  even  knowing  the 
underlying  mechanisms,  Varshavsky,  Finley  and  I  predicted  that 
the  ubiquitin  system  may  be  involved  in  the  regulation  of  the 
cell  cycle^®.  The  prediction  was  based  on  the  observation  that  the 
ts85  mutant  cell  is  defective  in  both  El  that  inactivates  the  ubiq¬ 
uitin  system  and  in  transition  along  the  S/G2  boundary  of  the 
cell  cycle.  This  prediction  has  been  corroborated  by  many  studies 
demonstrating  involvement  of  the  ubiquitin  system  in  pro¬ 
grammed  degradation  of  a  broad  array  of  cell  cycle  regulators. 

Given  the  many  processes  and  substrates  involved,  recent  in¬ 
dications  of  the  involvement  of  the  system  in  the  pathogenesis 
of  many  inherited  as  well  as  acquired  diseases  are  not  unex¬ 
pected.  Drug  companies  are  trying  to  target  the  aberrations  in 
the  system  that  underlie  these  pathologies.  The  common  de¬ 
nominator  shared  by  all  these  diseases  is  a  change  in  the  steady- 
state  level  of  a  particular  protein  substrate  or  set  of  protein 
substrates.  In  general,  the  diseases  belong  to  two  classes,  result¬ 
ing  from  accelerated  or  decreased  rates  of  degradation  of  differ¬ 
ent  substrates  (Fig.  3).  The  second  class  can  be  further  divided 
into  diseases  due  to  mutations  in  enzymes  of  the  system  or  to 
mutations  in  recognition  motifs  of  substrates  (Fig.  3).  Although 
it  is  not  possible  to  systematically  review  here  all  these  diseases 
(reviewed  recently  in  ref.  39),  a  few  salient  examples  follow.  An 
example  of  accelerated  degradation  involves  the  degradation  of 
p53  induced  by  the  human  papillomavirus  (HPV)  E6  oncopro¬ 
tein,  which  probably  underlies  the  pathogenesis  of  human  uter¬ 
ine  cervical  carcinoma,  a  very  prevalent  and  severe  malignant 
disease.  E6  associates  with  p53  and  targets  it  for  rapid  degrada¬ 
tion  mediated  by  the  ubiquitin  ligase  E6-associated  protein  (E6- 
AP),  which  does  not  recognize  the  free  tumor  suppressor. 
Inactivation  of  the  cellular  DNA  damage-control  machinery  ex¬ 
poses  the  cell  to  malignant  transformation.  Mutations  in  E6-AP 
lead  to  Angelman  syndrome,  an  inherited  disease  associated 
with  severe  mental  retardation  and  motor  disorders.  Here,  accu¬ 
mulation  of  unidentified  native  substrate(s)  of  E6-AP  (p53  is  not 
the  native  substrate  of  the  enzyme  and  is  normally  targeted  by 
Mdm2)  is  probably  toxic  to  the  developing  brain.  In  another  ex¬ 
ample,  mutations  in  the  phosphorylation-targeting  motif  of  the 
transcription  factor  p-catenin,  or  in  adenomatosis  polyposis  coli 
(APC)  that  is  part  of  the  P-catenin  degradation  complex,  lead  to 
stabilization  and  accumulation  of  the  protein,  accompanied  by 
its  uncontrolled  activity.  These  mutations  may  be  involved  in 
the  pathogenesis  of  many  forms  of  colorectal  carcinomas  and 
malignant  melanomas.  Viruses  such  as  HPV  have  evolved  differ¬ 
ent  mechanism  that  enable  them  to  evade  the  normal  mode  of 
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Simultaneous  beneficial  and  harmful  effects  of  drug  targeting  of  the  ubiquitin  ligase,  E3 


Constitutive  degradation  of  the 
phosphorylated  substrate 


GSK3B  (+) 


Resting 

cel! 


t 


Constitutive  stabilization 
of  the  non-phosphorylated 
substrate 


IKK  (-) 


Fig.  4  Drug  targeting  of  an  E3  enzyme  can  become 
a  'double-edged  sword'.  In  the  resting  cell  (top), 

IkBo  (blue  arrow  and  beige  box,  upper  right)  Is  not 
phosphorylated  and  degraded  slowly,  whereas  p- 
catenin  is  phosphorylated  constitutively  by  GSK3P 
and  is  degraded  rapidly  following  ubiquiti nation  by 
the  -E3-ubiquitin-ligase  complex  (heavy  red 

arrow  and  light  blue  box,  upper  left).  In  the  signaled 
cell  (bottom),  IkB(x  is  phosphorylated  by  IkB  kinase 
(IKK),  rapidly  ubiquitinated  by  the  same  SCF-TrCP 
complex  and  degraded  (thick  red  arrow  and  light  blue 
box,  lower  right).  Also,  in  signaled  cells,  GSK3P  is  in¬ 
hibited,  and  the  non-phosphorylated  P-catenin  is  sta¬ 
bilized,  translocated  into  the  nucleus  and  stimulates 
transcription  (blue  arrow  and  beige  box,  lower  left).  In 
general,  kB-containing  cells  are  distinct  from 
p-catenin-containing  cells  and  so  are  the  signals  that 
activate  the  two  pathways.  However,  the  phosphory¬ 
lated  recognition  motifs  of  the  two  proteins  are  similar  and  they  seem  to  be  targeted  by  the  same  TrCP  ubiquitin  ligase.  An  E3  inhibitor  (inhibitory  drug,  ID) 
can  lead  to  inhibition  of  degradation  of  IkB  in  stimulated  cells,  and  consequently  to  suppression  (beneficial  effect)  of  NF-KB-induced  inflammatory  processes 
that  may  occur  in  autoimmune  diseases,  for  example.  At  the  same  time,  ID  treatment  will  result  in  suppression  of  degradation  of  phosphorylated  p-catenin 
in  resting  cells,  with  resultant  accumulation  of  the  transcription  factor  and  possible  subsequent  malignant  transformation  (harmful  effect). 


activity  of  the  ubiquitin  system  and  allow  them  to  continue  their 
replication  and  propagation.  Epstein  Barr  nuclear  antigen  1 
(EBNA-1)  persists  in  healthy  carriers  for  life,  and  its  persistence 
contributes  to  some  of  the  virus-related  pathologies.  Unlike  all 
other  Epstein  Barr  viral  proteins,  EBNA-1  cannot  elicit  a  cy¬ 
totoxic  T  lymphocyte  (CTL)  response.  A  long,  C-terminal  Gly-Ala 
repeat  inhibits  ubiquitin-mediated  degradation  and  subsequent 
major  histocompatibility  complex  (MHC)  class  I  antigen  presen¬ 
tation  of  EBNA-1.  The  human  cytomegalovirus  (CMV)  encodes 
two  endoplasmic  reticulum  (ER)  resident  proteins,  US2  and 
USll,  that  bind  to  MHC  class  I  molecules  in  the  ER  and  escort 
them  to  the  translocation  machinery.  After  retrograde  transport 
to  the  cytoplasm,  they  are  ubiquitinated  and  degraded  by  the 
proteasome.  Removal  of  the  MHC  molecules  enables  the  virus  to 
evade  the  immune  system.  A  completely  different  case  involves 
Liddle  syndrome.  In  this  disorder,  a  mutation  in  the  recognition 
motif  that  targets  the  kidney  epithelial  sodium  channel  (ENaC). 
to  ubiquitination  by  the  Nedd4  E3  leads  to  accumulation  of  the 
channel,  excessive  reabsorption  of  sodium  and  water,  with  re¬ 
sulting  severe  hypertension. 

Because  of  the  central  function  of  the  ubiquitin  system  in 


many  basic  cellular  processes,  development  of  drugs  that  modu¬ 
late  the  system  may  be  difficult.  Inhibition  of  enzymes  common 
to  the  entire  pathway,  such  as  the  proteasome,  may  affect  many 
processes  nonspecifically,  although  a  narrow  'window'  between 
beneficial  effects  and  toxicity  can  be  identified  for  a  short-term 
treatment.  An  attractive  possibility  is  the  development  of  small 
molecules  that  inhibit  specific  E3  molecules.  For  example,  spe¬ 
cific  phospho-peptide  derivatives  can  inhibit  the  p-TrCP  ubiqui¬ 
tin  ligase,  E3  complex  (P-Transducin  repeat-Containing 
Protein^®).  However,  this  approach  can  turn  into  a  'double- 
edged  sword'  (Fig.  4).  Ideally,  small  molecules  should  be  devel¬ 
oped  that  bind  to  specific  substrates  or  to  their  ancillary 
proteins,  and  thus  inhibit  a  specific  process.  Peptide  aptamers 
(small  molecules/peptides  that  bind  to  active/association  sites 
of  proteins  and  inhibit  their  native  interactions)  that  bind 
specifically  to  HPV  E6  and  probably  prevent  its  association  with 
p53,  have  been  shown  to  induce  apoptosis  and  reverse  certain 
malignant  characteristics  in  HPV-transformed  cells,  probably  by 
interfering  with  p53  targeting^ k  Unfortunately,  because  of  the 
rarity  of  proteins  targeted  by  similar  mechanisms,  this  approach 
may  be  currently  limited  to  a  small  number  of  cases. 


Discovering  the  functions  and  degrons  of  the  ubiquitin  system 


Through  preparation,  help  from  friends 
and  a  lot  of  luck  I  was  able  to  leave  the 
former  Soviet  Union  in  the  fall  of  1977, 
and  ended  up  in  Boston.  A  month  later  I  was  a  faculty  member 
of  the  Biology  Department  of  the  Massachusetts  Institute  of 
Technology  (MIT),  before  I  knew  what  exactly  grants  were  (and 
before  the  colleagues  who  hired  me  became  aware  of  that  fact). 
In  Moscow,  I  studied  chromosome  structure  and  regulation  of 
gene  expression,  and  looked  forward  to  continuing  this  work. 

There  were  few  similarities  between  my  earlier  milieu  and  the 
astonishing  new  life.  The  libraries  were  one  of  them.  They  were 
just  as  quiet  and  pleasant  in  Cambridge  as  in  Moscow,  and  a  li¬ 
brary  at  MIT  soon  became  my  second  home.  Reading  there  I 
came  across  a  curious  1977  paper  by  Harris  Busch,  Ira  Godknopf 
and  their  colleagues.  They  found  a  DNA-associated  protein  that 
had  one  C-terminus  but  two  N-termini,  an  unprecedented  struc¬ 
ture.  The  short  arm  of  that  Y-shaped  protein  was  joined. 


through  its  C-terminus,  to  an  internal  Lys 
of  histone  H2A.  The  short  arm  was  soon 
identified,  by  Margaret  Dayhoff,  as  ubiq¬ 
uitin,  a  76-residue  protein  of  unknown  function  that  was  de¬ 
scribed  (as  a  free  protein)  by  Gideon  Goldstein  and  colleagues  in 
1975  (ref.  10). 

I  became  interested  in  this  first  ubiquitin  conjugate,  UbH2A. 
Back  in  Russia,  I  had  begun  to  develop  a  method  for  high-resolu¬ 
tion  analysis  of  nucleosomes.  These  DNA-protein  complexes 
were  subjected  to  electrophoresis  in  a  low-ionic-strength  poly¬ 
acrylamide  gel  (a  forerunner  of  the  gel-shift  assay),  followed  by 
second-dimension  electrophoresis  of  either  DNA  or  proteins.  We 
located  UbH2A  in  a  subset  of  the  nucleosomes,  succeeded  in  sep¬ 
arating  these  nucleosomes  from  those  lacking  UbH2A,  and  even¬ 
tually  showed  that  UbH2A-containing  nucleosomes  were 
enriched  in  transcribed  genes  and  excluded  from  the  inactive 
(heterochromatic)  parts  of  the  chromosomes^. 


Alexander  Varshavsky 
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Meanwhile,  Avram  Hershko,  his  graduate  student  Aaron 
Ciechanover  and  their  colleagues  at  the  Technion  (Haifa,  Israel) 
were  studying  ATP-dependent  protein  degradation  in  extracts 
from  rabbit  reticulocytes.  In  1978-1980  they  demonstrated  that 
a  small  protein,  which  they  called  APF-1  (ATP-dependent  prote- 
ol)d;ic  factor  1),  was  covalently  conjugated  to  proteins  that  were 
about  to  be  degraded  in  the  extract.  They  suggested  that  a  pro¬ 
tein-linked  APF-1  served  as  a  signal  for  a  downstream  protease, 
and  began  the  analysis  of  enzymology  of  APF-1  conjugation.  In 
1980,  Keith  Wilkinson,  Michael  Urban  and  Arthur  Haas  showed 
that  APF-1  and  ubiquitin  were  the  same  protein^. 

When  I  saw  that  1980  paper,  two  seemingly  independent 
realms,  protein  degradation  and  chromosomes,  came  together.  I 
realized  that  we  were  dealing  with  a  proteolytic  system  of  im¬ 
mense  complexity  and  exceptionally  broad,  still  to  be  discovered, 
range  of  functions.  I  decided  to  find  genetic  approaches  to  this 
entire  problem,  because  a  system  of  such  complexity  was  unlikely 
to  be  understood  through  biochemistry  alone.  In  1980,  reverse- 
genetic  techniques  were  about  to  become  feasible  with  the  yeast 
Saccharomyces  cerevisiae,  but  were  still  a  decade  away  from  mam¬ 
malian  genetics.  I  continued  to  read,  as  widely  as  I  could.  On  a 
fateful  day  at  the  end  of  1980, 1  came  across  a  paper  by  Yamada 
and  colleagues  that  described  a  conditionally  lethal,  temperature- 
sensitive  mouse  cell  line  called  ts85.  The  researchers  showed  that 
a  specific  nuclear  protein  disappeared  from  ts85  cells  at  increased 
temperatures,  and  suggested  that  this  protein  might  be  UbH2A. 
When  I  saw  their  data,  I  had  to  calm  down  to  continue  reading, 
because  I  knew  that  this  protein  was  UbH2A.  (In  the  preceding 
two  years  we  had  learned  much  about  the  electrophoretic  proper¬ 
ties  ofUbH2A.) 

Daniel  Finley  had  just  joined  my  lab  to  study  regulation  of  gene 
expression,  but  soon  switched  to  ts85  cells.  A  few  months  into  the 
project,  Finley  and  I  made  the  crucial  observation  that  ubiquitin 
conjugation  in  an  extract  from  ts85  cells  was  temperature-sensi¬ 
tive,  in  contrast  to  an  extract  from  parental  cells.  Soon  afterward, 
I  invited  Ciechanover,  who  came  from  the  Hershko  laboratory  for 
a  postdoctoral  stint  at  another  MIT  lab,  to  join  Finley  and  me  in 
the  continuing  study  of  ts85  cells.  He  did,  and  we  published  two 
papers  in  1984  that  demonstrated  two  main  results:  that  mouse 
ts85  cells  have  a  temperature-sensitive,  ubiquitin-activating  (El) 
enzyme,  and  that  these  cells  stop  degrading  the  bulk  of  their  nor¬ 
mally  short-lived  proteins  at  the  nonpermissive  tempera ture^®'^^. 
This  was  the  first  evidence  that  ubiquitin  conjugation  was  re¬ 
quired  for  protein  degradation  in  vivo.  These  findings^®'^^  also  indi¬ 
cated  that  ubiquitin  conjugation  was  essential  for  cell  viability.  In 


Fig.  5  The  ubiquitin  system  of  S.  cerevisia^\  The  yeast  ubiquitin  genes, 
two  of  which  {UBI1  and  UBf2)  contain  introns,  encode  fusion  proteins  of 
ubiquitin  (yellow  rectangles)  to  itself  {UBI4)  or  to  one  of  the  two  specific  ri- 
bosomal  proteins  {UBI1-UBI3)  (red  and  blue  rectangles).  These  fusion  pro¬ 
teins  are  cleaved  by  deubiquitinating  enzymes,  yielding  mature  ubiquitin. 
Thioester  bonds  between  ubiquitin  and  the  active-site  Cys  residues  of  ubiq- 
uitin-specific  enzymes.  The  conjugation  of  ubiquitin  to  other  proteins  in¬ 
volves  a  preliminary  ATP-dependent  step,  in  which  the  last  residue  of 
ubiquitin  (Gly76)  is  joined,  through  a  thioester  bond,  to  a  Cys  residue  in  the 
ubiquitin-activating  (El)  enzyme  encoded  by  UBAl.  The  activated  ubiquitin 
is  transferred  to  a  Cys  residue  in  one  of  at  least  1 3  distinct  ubiquitln-conju- 
gating  (E2)  enzymes  encoded  by  the  bBC  family  genes,  and  from  there  to  a 
Lys  residue  of  an  ultimate  acceptor  protein  (yellow  oval).  This  last  step  and 
the  formation  of  a  multi-ubiquitin  chain  (black  ovals)  require  participation 
of  another  component,  called  E3  (the  names  of  some  of  the  yeast  E3  pro¬ 
teins  are  included).  A  targeted,  ubiquitinated  protein  substrate  is  proces- 
sively  degraded  to  short  peptides  by  the  ATP-dependent  26S  proteasome. 
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addition,  ts85  cells  tended  to  be  arrested  at  the  G2  phase  of  the 
cell  cycle,  and  the  synthesis  of  heat-shock  proteins  was  strongly 
induced  in  these  cells  at  the  nonpermissive  temperature,  indicat¬ 
ing  that  ubiquitin-dependent  proteolysis  is  involved  in  the  cell- 
cycle  progression  and  stress  response^^  In  1983,  Tim  Hunt  and 
colleagues  discovered  unusual  proteins  in  rapidly  dividing  fertil¬ 
ized  clam  eggs.  These  proteins,  which  they  called  cyclins,  were  de¬ 
graded  at  the  exit  from  mitosis.  We  suggested  in  1984  that  cyclins 
were  destroyed  by  the  ubiquitin  system^^,  a  hypothesis  shown  to 
be  correct  by  Michael  Glotzer,  Andrew  Murray  and  Marc 
Kirschner  in  1991. 

The  ts85  results^®'^^  left  little  doubt,  among  the  optimists, 
about  the  importance  of  the  ubiquitin  system  in  cellular  physiol¬ 
ogy.  Unfortunately,  these  findings  could  not  be  deepened  and 
made  more  rigorous,  because  of  limitations  of  mammalian  so¬ 
matic  cell  genetics,  which  was  still  hampered  at  that  time  by  the 
impossibility  of  altering  genes  at  will.  Therefore,  in  1983  we 
began  systematic  analysis  of  the  ubiquitin  system  in  S.  cerevisiae 
(Fig.  5).  In  1984,  Finley  and  Engin  Ozkaynak  cloned  the  first 
ubiquitin  gene,  and  found  that  it  encoded  a  polyubiquitin  pre¬ 
cursor  protein.  By  1987,  they  showed  that  this  gene,  UBI4,  was 
strongly  induced  by  different  stresses.  Moreover,  deletion  of 
UBI4  resulted  in  cells  that  were  hypersensitive  to  every  noxious 
treatment  we  tried,  including  heat  and  oxidative  stress'^^.  These 
results  validated  and  extended  an  inference  from  the  1984  find¬ 
ings  with  ts85  cells,  thereby  establishing  one  broad  and  essential 
function  for  the  ubiquitin  system. 

In  a  parallel  1987  study,  Stefan  Jentsch  and  John  McGrath  iso¬ 
lated  ubiquitin-conjugating  (E2)  enzymes  from  S.  cerevisiae.  One 
evening,  a  phone  call  from  an  excited  Stefan  Jentsch  marked  the 
discovery  of  yet  another  function  of  the  ubiquitin  system:  a  par¬ 
tially  sequenced  yeast  E2  enzyme  was  found  to  be  RAD6,  a  pro¬ 
tein  known  to  yeast  geneticists  for  years  as  an  essential 
component  of  DNA  repair  pathways'^^.  RAD6  was  the  first  enzyme 
of  the  ubiquitin  system  that  was  shown  to  mediate  a  specific 
physiological  function.  The  sequence  of  RAD 6  was  weakly  similar 
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to  that  of  CDC34,  an  essential  cell-cycle  regulator  defined  geneti¬ 
cally  by  Leland  Hartwell.  In  1988,  a  collaboration  between  Breck 
Byer's  and  my  laboratories  demonstrated  that  CDC34  was  also  a 
ubiquitin-conjugating  enzyme'^'^  (Fig.  5).  This  result  transformed  a 
hint  from  our  ts85  work  into  a  definitive  demonstration  of  the  in¬ 
volvement  of  the  ubiquitin  system  in  cell-cycle  control. 

In  1989,  Finley  and  Bonnie  Bartel  discovered  that  ubiquitin 
genes  other  than  UBI4  (the  polyubiquitin  gene)  were  also  quite 
unusual:  UBI1-UBI3  encoded  fusions  of  ubiquitin  to  one  protein 
of  the  large  ribosomal  subunit  and  one  protein  of  the  small  ribo- 
somal  subunit,  an  arrangement  conserved  from  yeast  to  hu- 
mans'^'^  (Fig.  5).  Kenneth  Redman  and  Martin  Rechsteiner 
independently  identified  these  non-ubiquitin  extensions  as  ribo¬ 
somal  proteins.  The  transient  presence  of  ubiquitin  in  front  of  a 
ribosomal  protein  moiety  (ubiquitin  was  rapidly  cleaved  off  by 
deubiquitinating  enzymes)  was  found  to  be  essential  for  efficient 
biogenesis  of  the  ribosomes^^  Ubiquitin  acts,  in  these  settings, 
not  as  a  degradation  signal  but  as  a  molecular  chaperone.  The  fu¬ 
sion-imposed  1:1  molar  ratio  of  free  ubiquitin  to  a  free  ribosomal 
protein  (Fig.  5)  sets  an  upper  limit  for  the  number  of  newly  pro¬ 
duced  ribosomes  relative  to  the  number  of  newly  formed  ubiqui¬ 
tin  molecules.  This  tight  link,  through  DNA-encoded  fusions  of 
ubiquitin  and  ribosomal  proteins,  is  one  of  the  few  understood 
regulatory  interactions  between  protein  synthesis  and  protein 
degradation. 

The  enormous  expansion  of  the  ubiquitin  field  in  the  last 
decade  stemmed  mainly  from  these  functional  insights  of  the 
1980s,  which  demonstrated  both  the  involvement  of  ubiquitin 
conjugation  in  important  biological  processes  and  the  striking 
diversity  of  these  processes,  from  the  cell  cycle^’  to  DNA  re- 
pair'^^  ribosome  biogenesis^^  and  stress  responses^^.  Many  more 
functions  have  been  added  to  this  list  since  1990. 


Fig.  6  The  N-end  rule  pathway.  Notations  in  the  yeast  (a)  and  mouse  (b) 
pathways  show  type  1  (purple)  and  type  2  (red)  primary,  secondary  (light 
blue)  and  tertiary  (green)  destabilizing  N-terminal  residues;  yellow  ovals  indi¬ 
cate  the  rest  of  a  protein  substrate,  a,  The  in  vivo  half-lives  of  X-Pgals,  p-galac- 
tosidase-based  test  proteins  in  5.  cerevisiae"*^  (right).  X-pgal  proteins  bearing 
stabilizing  N-terminal  residues  (black)  are  metabolically  stable  (h/z,  more 
than  20  h).  The  tertiary  destabilizing  residues  N  (Asn)  and  Q  (Gin)  are  con¬ 
verted  into  secondary  destabilizing  residues  D  (Asp)  and  E  (Glu)  by  N-termi¬ 
nal  amidohydrolase  (Nt-amidase),  encoded  by  NTAh  D  and  E  are 
conjugated  to  R  (Arg),  one  of  the  primary  destabilizing  residues,  by  Arg-RNA 
protein  transferase  (R-transferase),  encoded  by  ATEh  b,  In  the  mammalian 
N-end  rule  pathway,  the  deamidation  step  is  mediated  by  two  distinct  en¬ 
zymes,  Nt'^-amidase  and  Nt^-amidase,  specific  for  N-terminal  Asn  and  Gin 
residues,  respectively  In  vertebrates,  the  set  of  secondary  destabilizing 
residues  contains  not  only  Asp  and  Glu  but  also  Cys  (C),  which  is  a  stabilizing 
residue  in  yeast“.  In  mammals  but  not  in  yeast,  Ala  (A),  Ser  (S)  and  Thr  (T) 
are  primary  (type  3)  destabilizing  residues'*®,  c,  5.  cerevisiae  UBRl  has  two 
binding  sites  for  the  primary  destabilizing  N-terminal  residues  of  either  pro¬ 
teins  or  short  peptides.  The  type  1  site  is  specific  for  basic  N-terminal  residues 
Arg,  Lys  and  His.  The  type  2  site  is  specific  for  bulky  hydrophobic  N-terminal 
residues  Phe,  Leu,  Trp,  Tyr  and  He.  UBRl  contains  yet  another  substrate-bind¬ 
ing  site  (i),  which  targets  proteins  bearing  internal  (non-N-terminal)  degrons. 
In  yeast,  these  proteins  include  the  CUP9  repressoi^^  A  complex  of  UBRl  and 
the  ubiquitin-conjugating  (E2)  enzyme  RAD6  produces  a  substrate-linked 
multi-ubiquitin  chain'*®. 


How  are  proteins  recognized  as  substrates  for  ubiquitin  conju¬ 
gation?  The  first  solution  to  this  problem  was  produced  in  1986, 
when  Andreas  Bachmair  and  Finley  discovered  the  first  degrada¬ 
tion  signals  in  short-lived  proteins'*^.  We  constructed  ubiquitin 
fusion  proteins  in  which  ubiquitin  was  followed  by  a  reporter 
moiety  such  as  Escherichia  coli  p-galactosidase,  and  expressed 
them  in  5.  cerevisiae.  The  first  advance  took  place  when  we 
learned  that  the  ubiquitin  moiety  of  these  fusion  proteins  was 
rapidly  removed  by  deubiquitinating  enzymes  regardless  of  the 
identity  of  the  residue  at  the  C-terminal  side  of  the  cleavage  site, 
with  Pro  being  the  sole  exception.  Thus  was  born  the  ubiquitin 
fusion  technique,  which  made  it  possible  to  place,  in  vivo,  any 
desired  residue  (except  Pro)  at  the  N-terminus  of  a  protein  of  in- 
teresU^  The  presence  of  Met  at  the  N-termini  of  nascent  proteins 
and  the  substrate  specificity  of  c5d;osolic  Met  aminopeptidases 
did  not  allow  this  level  of  experimental  freedom  before  the  dis¬ 
covery  of  the  ubiquitin  fusion  technique^^ 

Using  this  method,  Bachmair  and  Finley  discovered  that  the  in 
vivo  half-life  of  a  test  protein  was  strongly  dependent  on  the 
identity  of  its  N-terminal  residue,  a  simple  relation  called  the  N- 
end  rule'*^  The  underlying  ubiquitin-dependent  pathway,  called 
the  N-end-rule  pathway  (Fig.  6),  was  later  found  to  be  present  in 
all  eukaryotes,  from  fungi  to  plants  and  mammals,  and  even  in 
prokaryotes,  which  lack  ubiquitin^^.  Yet  another  degradation  sig¬ 
nal  identified  in  1986  was  the  N-terminal  ubiquitin  moiety  of  a 
fusion  protein  under  conditions  that  precluded  its  removal  by 
deubiquitinating  enzymes'*^  This  signal  is  targeted  by  a  distinct 
pathway  of  the  ubiquitin  system^^. 

A  family  of  signals,  called  N-degrons,  that  give  rise  to  the  N- 
end  rule  is  still  the  best-understood  set  of  degradation  signals.  An 
N-degron  consists  of  a  substrate's  destabilizing  N-terminal 
residue  and  an  internal  Lys  residue,  the  latter  being  the  site  of 
ubiquitin  attachment^®  The  E2-E3  ubiquitin  ligase  (Fig.  6c) 
binds  to  the  substrate's  N-terminal  residue  and  forms  a  multi- 
ubiquitin  chain  linked  to  a  substrate's  Lys  residue,  the  selection 
of  which  is  often  the  result  of  stochastic  choice  among  several 
sterically  suitable  Lys  residues'*®.  This  bi-partite  organization  is 
also  characteristic  of  subsequently  identified  degradation  signals 
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UBR1  — I CUP9  — I PTR2  dipeptide  uptake 


Fig.  7  Ubiqultin-dependent  activation  of  peptide  import  in  5.  cere- 
visiae^.  a,  Genetic  diagram  of  the  peptide  transport  circuit,  b,  UBRl  is  re¬ 
quired  for  di-peptide  uptake.  In  the  absence  of  UBRl  (ubrIA),  the 
transcriptional  repressor  CUP9  is  long-lived,  accumulates  to  high  levels 
and  extinguishes  the  expression  of  peptide  transporter  encoded  by  PTR2. 
The  ubrIA  cells  cannot  import  di-peptides  (red  dots),  c.  In  a  UBRl  cell 
growing  in  the  absence  of  extracellular  di-peptides,  UBRl  targets  CUP9 
for  degradation  (ti/2,  about  5  min),  resulting  in  a  lower  steady-state  con¬ 
centration  of  CUP9  and  weak  but  substantial  expression  of  the  PTR2 
transporter  (blue  double  ovals),  d,  In  UBRl  cells  growing  in  the  presence 
of  extracellular  di-peptides,  some  of  which  bear  destabilizing  N-terminal 
residues,  the  imported  dl-peptides  bind  to  the  basic  (type  1;  red  rectan¬ 
gle)  or  hydrophobic  (type  2;  green  wedge)  residue-binding  sites  of 
UBRl .  Binding  of  either  type  of  di-peptide  to  UBRl  allosterically  increases 
the  rate  of  UBRl -mediated  degradation  of  CUP9.  The  resulting  decrease 
of  the  half-life  of  CUP9  from  about  5  min  to  less  than  1  min  leads  to  a  fur¬ 
ther  decrease  in  CUP9  levels,  and  consequently  to  a  strong  induction  of 
the  PTR2  transporter^^. 


in  cyclins,  transcription  factors  and  other  short-lived  proteins. 
One  unique  feature  of  N-degrons  is  that  substrates  bearing  cer¬ 
tain  destabilizing  N-terminal  residues  are  chemically  modified  in 
vivo,  through  their  enz3miatic  deamidation  or  arginylation,  be¬ 
fore  a  substrate  can  be  bound  by  the  ubiquitin  ligase^®'^^  (Fig.  6). 

Having  been  the  first  ubiquitin-dependent  pathway  to  be  de¬ 
fined  through  molecular  genetic  methods,  the  N-end  rule  path¬ 
way  (Fig.  6)  was  also  the  setting  in  which  several  essential 
insights  relevant  to  the  entire  ubiquitin  system  were  first  made, 
including  the  discovery  of  specific  multi-ubiquitin  chains  and 
their  function  in  proteolysis^^  In  1985,  Hershko  and  Heller  sug¬ 
gested,  on  the  basis  of  chemical  modification  data,  that  some 
ubiquitin  moieties  in  multi-ubiquitinated  proteins  might  be 
linked  together  in  a  chain.  In  1989,  Vincent  Chau  and  colleagues 
in  my  laboratory  demonstrated  the  existence  of  protein-linked 
multi-ubiquitin  chains,  found  them  to  have  unique  topology 
(ubiquitin-ubiquitin  bonds  through  Lys48  of  ubiquitin)  and 
showed  that  these  chains  were  required  for  degradation  of  test 
proteins^^.  We  proposed  that  the  main  function  of  a  substrate- 
linked  multi-ubiquitin  chain  is  to  bind  the  substrate  to  the  pro- 
teasome^'*.  The  complexity  and  multiplicity  of  ways  in  which  a 
substrate  is  delivered  to  the  proteasome  is  demonstrated  by  the 
recent  discovery  that  ubiquitin  ligases  themselves  physically  in¬ 
teract  with  specific  subunits  of  the  26S  proteasome^^. 

Subunit  selectivity  of  protein  degradation  was  yet  another  fun¬ 
damental  feature  of  the  ubiquitin  system  that  was  first  discov¬ 
ered  in  the  N-end  rule  pathway.  Erica  Johnson  and  David  Gonda 
demonstrated  in  1990  that  this  pathway  can  eliminate  one  sub¬ 
unit  of  an  oligomeric  protein  selectively,  leaving  intact  the  other 


subunits  of  the  same  protein  molecule”.  It  is  specifically  the  sub¬ 
unit  conjugated  to  a  multi-ubiquitin  chain  that  gets  destroyed. 
Subunit  selectivity  of  proteolysis  underlies  large  differences  in 
the  in  vivo  half-lives  of  subunits  in  oligomeric  proteins.  This  es¬ 
sential  feature  of  the  ubiquitin  system  is  both  powerful  and  flex¬ 
ible,  in  that  it  allows  protein  degradation  to  be  wielded  as  an 
instrument  of  either  positive  or  negative  control.  Among  many 
examples  are  activation  of  transcription  factor  NF-kB  through 
degradation  of  its  inhibitory  ligand  IF-kB,  and  inactivation  of  cy- 
clin-dependent  kinase  activity  through  degradation  of  a  regula¬ 
tory  cyclin  subunit. 

The  emerging  functions  of  the  N-end  rule  pathway  have  been 
described”'”.  Among  these  functions,  the  best  understood  is  the 
essential  role  of  this  pathway  in  a  positive  feedback  circuit  that 
regulates  the  import  of  peptides  in  5.  cerev/szae”-”  (Fig.  7). 
Imported  peptides  bearing  destabilizing  N-terminal  residues 
bind  to  the  recognition  sites  for  N-end  rule  substrates  in  UBRl, 
the  pathway's  E3  enzyme.  This  binding  allosterically  activates 
yet  another  substrate-binding  site  of  UBRl,  leading  to  acceler¬ 
ated  degradation  of  the  transcriptional  repressor  CUP9.  The  re¬ 
sulting  derepression  of  expression  of  the  peptide  transporter 
PTR2  greatly  increases  the  cell's  capacity  to  import  peptides”. 
This  circuit  (Fig.  7)  is  the  first  example  of  small  compounds  being 
natural  allosteric  regulators  of  the  ubiquitin  system. 

A  backward  glance:  It's  Moscow,  and  the  year  is  1968.  The  au¬ 
thor,  a  chemistry  undergraduate  both  cocky  and  insecure,  is  lis¬ 
tening  to  a  leading  Russian  biochemist,  a  man  in  his  forties 
whose  education  and  entire  life  were  warped  by  the  combined 
cruelties  of  Stalinism  and  the  Lysenko-led  destruction  of  Russian 
genetics.  The  great  man  was  telling  me  something  he  considered 
self-evident:  "Ah,  Alex,  don't  waste  your  time  on  genetics.  It's  all 
ancient  Greece,  beautiful  in  a  strange  way,  but  next  to  useless. 
They  keep  tormenting  fruit  flies,  but  it's  us  biochemists  who  will 
produce  the  understanding  that  really  matters."  Having  spent  a 
day  reading  genetic  papers,  I  sensed  that  he  could  not  be  right, 
that  genetics  was  essential  too.  Over  the  next  three  decades,  the 
dynamic  interaction  of  genetics  and  biochemistry  kept  yielding 
insights  that  could  not  be  produced  by  biochemistry  or  genetics 
alone.  These  advances,  many  of  them  technical  in  nature,  have 
transformed  biology,  and  are  beginning  to  be  felt  in  medicine. 

The  early  history  of  the  ubiquitin  field  recapitulates,  in  a  mi¬ 
crocosm,  the  essential  interaction  between  biochemistry  and  ge¬ 
netics  that  underlies  the  phenomenon  of  modern  biology. 
Biochemical  studies  by  Hershko,  Ciechanover  and  their  col¬ 
leagues  revealed  a  mechanistically  unexpected,  most  curious  but 
functionally  obscure  pathway  of  protein  degradation.  Molecular 
genetic  (as  well  as  biochemical)  work  proved  necessary  for  dis¬ 
covering  the  first  physiological  functions  of  ubiquitin-depen¬ 
dent  proteolysis  and  the  first  degradation  signals  in  short-lived 
proteins.  Methods  and  approaches  developed  in  this  work,  in¬ 
cluding  the  ubiquitin  fusion  technique'^^  continue  to  be  of  use  in 
the  ubiquitin  field  and  beyond. 

The  vast  expansion  of  ubiquitin  studies  over  the  last  decade, 
with  hundreds  of  laboratories  around  the  world  working  on  the 
ubiquitin  system  and  its  legion  of  biological  functions,  is  a  sight 
to  behold.  The  fundamental  understanding  of  this  system,  and 
recent  insights  into  its  roles  in  health  and  disease  will  have  a  pro¬ 
found  influence  on  the  realm  of  therapeutic  drugs.  The  reason  is 
not  just  the  obvious  one— promising  drug  targets  among  the 
ubiquitin  system's  components  and  substrates— but  also  the  pos¬ 
sibility  of  developing  drugs  that  could  direct  this  system  to  de¬ 
stroy  (and  thereby  to  inhibit  functionally)  any  protein  target. 
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Hepatitis  C  Virus  and  eliminating  post-transfusion  hepatitis 


Harvey  J.  Alter  &  Michael  Houghton 


You'll  wonder  where  the  yellow  went:  A  30-year  perspective  on  the  near-eradication 

of  post-transfusion  hepatitis 

The  story  I  will  relate  here  traces  the  near-  this  would  be  a  lifetime  endeavor.  My  first 

total  eradication  of  transfusion-associated  HaRVEY  J.  AlTER  function  was  to  continue  and  expand  on 

hepatitis  over  the  course  of  three  decades.  prospective  studies  of  post-transfusion  he- 

I  am  perhaps  the  thread  that  links  these  events,  but  the  story  is  a  patitis  initiated  by  John  Walsh,  Bob  Purcell,  Paul  Holland  and 


fabric  woven  by  many  collaborators  who  played  essential  parts 
and  by  the  conducive  environment  of  the  National  Institutes  of 
Health  (NIH)  intramural  program  that  has  nurtured  these  clini¬ 
cal  investigations.  The  story  and  my  research  career  began  in  the 
early  1960s  when,  as  an  NIH  Clinical  Associate,  I  investigated  the 
non-cellular  causes  of  febrile  transfusion  reactions  by  screening 
multiply  transfused  patients  for  antibodies  against  serum  pro¬ 
teins  using  agar  gel  diffusion.  Ouchterlony  plates  were  piled  high 
on  my  lab  bench,  the  way  unread  manuscripts  are  today.  One 
day  Richard  Aster  told  me  that  he  had  heard  an  interesting  lec¬ 
ture  by  Baruch  Blumberg,  who  was  using  similar  methodology  to 
investigate  protein  polymorphisms.  I  visited  Blumberg  and 
began  a  collaboration  that  within  a  year  uncovered  an  unusual 
precipitin  line  resulting  from  the  reaction  between  sera  from  a 
patient  with  hemophilia  and  an  Australian  Aboriginal  person 
(Fig.l).  The  line  was  unusual  in  that  it  stained  only  faintly  with 
lipid  dyes,  in  contrast  to  the  lipoprotein  polymorphisms  that 
were  then  being  studied.  Because  it  stained  red  with  the  azo¬ 
carmine  counter-stain,  we  initially  called  this  the  red  antigen, 
then  debated  calling  it  the  Bethesda  antigen  and  ultimately 
called  it  the  Australia  antigen,  based  on  the  evolving  nomencla¬ 
ture  for  new  hemoglobin  discoveries.  Subsequent  investigations 
showed  the  prevalence  of  Australia  antigen  to  be  only  0.1%  in 
the  donor  population,  but  very  high  (10%)  in  patients  with 
leukemia.  The  first  publication  on  the  Australia  antigen^  cited 
this  association  with  leukemia  in  the  title.  We  considered  that 
the  antigen  might  be  a  component  of  the  long-postulated 
leukemia  virus.  In  retrospect,  the  antigen  merely  reflected  the 
high  transfusion  exposure  and  the  immunocompromised  status 
of  these  patients.  My  initial  first-author  publication  was  the  bio¬ 
physical  characterization  of  the  Australia  antigen^. 

In  1964,  I  left  the  NIH  to  complete  my  training  in  internal 
medicine  and  hematology,  and  Blumberg  moved  to  the  Institute 

for  Cancer  Research  in  Philadelphia,  where  Mf  : - - 

he  continued  to  pursue  the  importance  of  P  J 
the  Australia  antigen.  Both  serendipity  and  |  ^ 
good  science  led,  by  1968,  to  the  linkage  of 
this  antigen  to  viral  hepatitis,  a  link  that 
transformed  the  study  of  hepatitis,  pro-  mff 
tected  the  blood  supply,  led  to  a  hepatitis  B 
vaccine  and  culminated  in  the  Nobel  prize.  i||| 

I  returned  to  the  NIH  in  1969  to  investi- 
gate  the  causes  and  prevention  of  post- 
transfusion  hepatitis  and  to  pursue  clinical  pjg  ^  Australian 

investigations  of  hepatitis  B  and  its  associ-  and  that  of  a  multipl) 

ated  antigens.  I  had  no  premonition  that  lipid,  but  stained  red 


Paul  Schmidt.  Walsh  and  colleagues  had  already  demonstrated 
the  inordinately  high  hepatitis  risk  of  blood  transfusion  and, 
particularly,  the  risk  of  paid-donor  blood.  In  1970,  Holland, 
Schmidt  and  I,  in  a  still-memorable  meeting  that  would  later  in¬ 
fluence  national  blood  policy,  decided  that  the  continued  use  of 
paid  donors  could  not  be  tolerated  and  also  concluded  that  we 
should  introduce  donor  screening  for  what  was  by  then  called 
the  hepatitis-associated  antigen.  I  then  simultaneously  did  a  ret¬ 
rospective  analysis  that  demonstrated  the  value  of  hepatitis  B 
antigen  testing  and  initiated  a  new  prospective  study  to  assess 
the  effect  of  this  dual  change  in  the  donor  supply.  The  result  was 
substantial:  hepatitis  incidence  among  patients  undergoing 
open-heart  surgery  plummeted  from  33%  to  9.7%  (ref.  3)  (Fig.  2). 
We  calculated  that  the  main  determinant  of  this  reduction  of 
about  70%  was  the  exclusion  of  paid  donors.  Indeed,  retrospec¬ 
tive  testing  for  hepatitis  B  virus  (HBV)  markers  showed  that  only 
20%  of  the  hepatitis  found  before  antigen  screening  was  related 
to  HBV.  The  recognition  of  transfusion-associated  non-B  hepati¬ 
tis  therefore  evolved.  Improved  hepatitis  B  antigen  assays 
brought  hepatitis  B  transmission  to  near  zero  by  1977.  In  1973, 
Steve  Feinstone,  who  had  the  unenviable  task  of  sifting  through 
stool  specimens  in  the  bowels  of  NIH  building  7,  used  immune 
electron  microscopy  to  discover  the  hepatitis  A  virus,  in  collabo¬ 
ration  with  A1  Kapikian  and  Bob  PurcelT.  We  immediately 
delved  into  our  repository  of  non-B  hepatitis  cases  and  were  sur¬ 
prised  to  find  that  not  a  single  case  was  due  to  hepatitis  A  virus^ 
In  a  less-than-brilliant  foray  into  nomenclature,  we  designated 
these  cases  non-A,  non-B  (NANB)  hepatitis.  Bob  Purcell,  in  par¬ 
ticular,  felt  that  we  should  not  call  the  agent  the  hepatitis  C  virus 
until  we  had  proved  transmissibility  and  until  we  established  the 
number  of  agents  that  might  be  involved.  In  our  optimism,  we 
did  not  suspect  that  the  designation  non-A,  non-B  would  persist 
for  15  years  before  its  specific  etiology  could  be  defined. 


Flg.1  An  Australian  Aborigine  (left)  and  the  precipitin  line  formed  between  the  aboriginal  serum 
and  that  of  a  multiply  transfused  patient  with  hemophilia  (right).  The  precipitin  failed  to  stain  for 
lipid,  but  stained  red  with  the  azocarmine  counter-stain  for  protein. 
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Fig.  2  The  decreasing  incidence  of  transfusion-associated  hepatitis  in  blood  recipients  monitored 
prospectively.  Incidence,  traced  from  1969  to  1998,  demonstrates  a  decrease  in  risk  from  33%  to 
nearly  zero.  Arrows,  main  interventions  in  donor  screening  and  selection  that  effected  this  change. 


In  1975,  as  the  prospective  studies  continued,  my  attention 
was  directed  at  proving  NANB  hepatitis  transmission  in  the 
chimpanzee  model.  Earlier  attempts  at  chimpanzee  transmission 
had  failed,  but  I  reasoned  that  we  could  use  the  highly  pedigreed 
samples  from  our  prospective  studies  and  use  inocula  in  volumes 
equivalent  to  blood  transfusion  in  humans.  We  achieved  success 
in  our  first  attempt,  as  five  of  five  chimpanzees  developed  in¬ 
creases  in  alanine  aminotransferase  (ALT)  at  appropriate  inter¬ 
vals  after  inoculation^  We  were  later  able  to  use  this  same 
approach  and  accomplish  the  first  experimental  animal  trans¬ 
mission  of  human  immunodeficiency  virus^  In  the  absence  of  a 
tissue  culture  system,  an  observed  particle  or  a  serologic  assay, 
the  availability  of  an  animal  model  was  essential  to  further  study 
of  the  NANB  agent.  Throughout  these  prospective  studies,  my 
main  collaborator  was  Bob  Purcell  of  the  National  Institute  of 
Allergy  and  Infectious  Diseases,  who  provided  the  basic  research 
arm  necessary  to  propel  the  investigations  of  NANB.  From 
among  the  50  NANB  hepatitis  cases  identified  at  that  time,  I  se¬ 
lected  a  patient  (H)  who  had  a  particularly  severe  acute  NANB 
hepatitis  and  from  whom  we  had  obtained  apheresis  units  while 
his  ALT  levels  were  increasing.  Purcell  established  a  titration  se¬ 
ries  of  the  H  plasma  and  then  did  infectivity  studies  in  the 
chimp.  H  had  an  infectivity  titer  of  10^  ^  CIDso/ml  (chimp  infec¬ 
tious  dosesso)/  allowing  us  to  then  undertake  a  series  of  manipu¬ 
lations  of  the  virus  and  to  test  their  effect  on  infectivity.  Such 
studies  by  Feinstone^,  He^  and  others  showed  that  the  NANB 
agent  was  sensitive  to  chloroform  and  less  than  30  nm  in  diame¬ 
ter,  based  on  filtration.  The  agent  of  NANB  hepatitis  thus  seemed 
to  be  small,  lipid-enveloped,  blood-transmissible  and  responsible 
for  most  residual  cases  of  transfusion-associated  hepatitis. 
Subsequently,  Patrizia  Farci  from  the  University  of  Cagliari,  in 
collaboration  with  Purcell  and  I,  did  a  series  of  chimpanzee  stud- 
ies^®’“  in  which  she  mixed  chronic  phase  serum  from  patient  H 
with  the  acute-phase  infectious  inoculum  and  studied  the  infec¬ 
tivity  of  the  mixture  in  the  chimp  model.  These  studies  made  the 
important  observation  that  neutralizing  antibodies  against  he¬ 
patitis  C  virus  (HCV)  develop,  but  are  very  strain-specific  and,  in 
most  cases,  incapable  of  preventing  the  emergence  of  viral  vari¬ 
ants  that  lead  to  persistent  infection. 


As  these  virologic  and  immunologic 
studies  were  proceeding  in  the  late  1970s 
and  early  1980s,  my  main  focus  was  to  de¬ 
fine  the  clinical  consequences  of  NANB 
virus  infection  and  to  establish  an  assay 
that  might  be  amenable  to  blood  screen¬ 
ing.  The  former  proved  easier  than  the  lat¬ 
ter.  The  entity  NANB  hepatitis  initially 
met  with  considerable  skepticism,  and 
some  believed  it  caused  only  an  irrelevant 
transaminitis,  because  few  patients  had 
recognized  clinical  illness.  However,  as  we 
monitored  patients  long-term  and  did 
liver  biopsies  in  collaboration  with  Jay 
Hoofnagle  and  the  NIH  Liver  Service,  it 
became  apparent  that  most  NANB-in- 
fected  patients  had  biochemical  evidence 
of  chronic  hepatitis,  and  that  20%  pro¬ 
gressed  to  cirrhosis  over  the  course  of  one 
to  two  decades.  Later,  after  the  discovery 
of  HCV,  we  expanded  these  natural  his¬ 
tory  studies  in  both  asymptomatic 
donors^^  and  transfusion  recipients'^  the 
latter  in  collaboration  with  Leonard  Seeff,  and  confirmed  that 
20-30%  of  HCV-infected  individuals  have  severe  histologic  out¬ 
comes.  However,  equally  important,  these  studies  showed  that 
about  20%  of  HCV-infected  individuals  undergo  spontaneous  re¬ 
covery  and  that  most  have  an  indolent,  perhaps  non-progres¬ 
sive,  course.  In  collaboration  with  Farci  and  Purcell,  we  have  also 
shown  the  considerable  viral  diversity  ("quasispeciesO  of  HCV  in¬ 
fection  and  that  the  extent  of  diversity  in  the  acute  phase  of  ill¬ 
ness  predicted  whether  chronic  infection  would  ensue^^ 

The  histologic  studies  that  documented  progression  to  cirrho¬ 
sis  made  it  even  more  imperative  to  develop  a  blood-screening 
assay.  In  the  decade  from  1978  to  1988,  we  attempted  every  per¬ 
mutation  of  serologic  approaches  to  assay  development.  Despite 
using  "highly  pedigreed'  infectious  specimens,  "presumed  con¬ 
valescent'  sera,  eluted  fractions,  purified  gamma  globulins  and 
the  most-sensitive  radio-immune  assay  approaches,  we  were  un¬ 
able  to  develop  a  specific  serologic  test  for  this  elusive  agent.  In 
the  absence  of  a  specific  assay,  we  looked  for  "surrogate'  markers 
that  might  identify  NANB  carriers.  The  most  logical  approach 
was  measuring  ALT.  Although  a  retrospective  analysis  of  our 
prospective  collections  showed  that  increases  in  ALT  in  the 
donor  correlated  with  hepatitis  transmission*^  we  were  unable 
to  show  the  efficacy  of  this  "surrogate'  assay  in  a  subsequent 
prospective  study.  We  then  sought  other  measures  of  donor  in¬ 
tervention  and  reasoned  that  donors  who  had  been  exposed  to 
HBV  might  also  be  more  likely  to  have  been  exposed  to  NANB; 
such  donors  were  likely  to  have  recovered  from  HBV  infection 
and  pass  the  donor  screen,  but  might  be  persistent  carriers  of 
NANB.  Thus,  we  used  antibody  against  hepatitis  B  core  antigen 
(HBc)  as  an  index  of  past  HBV  infection,  and  showed  in  a  retro¬ 
spective  analysis  of  our  cohort  that  donors  with  antibody 
against  HBc  were  four  times  more  likely  to  transmit  NANB  he¬ 
patitis  and  that  their  exclusion  might  prevent  30%  of  such 
transmissions*^  These  data  and  those  from  a  multicenter  collab¬ 
orative  transfusion-transmitted  viruses  study*^  convinced  the 
main  blood  organizations  to  implement  testing  for  antibodies 
against  HBc  and  for  ALT  in  routine  donor  screening  in  1987.  It 
was  difficult  to  measure  the  specific  effect  of  these  "surrogate'  as¬ 
says  because  the  threat  of  transfusion-associated  AIDS  had 
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fected  patients  could  be  linked  to  infected  donors^^  Thus,  by 
1990  it  was  clear  that  HCV  was  the  principal  agent  of  NANB  he¬ 
patitis,  and  universal  donor  screening  was  initiated.  We  estab¬ 
lished  a  new  prospective  study  to  measure  the  effect  of  such 
testing  and  to  define  the  extent  of  residual  hepatitis  unrelated 
to  HBV  or  HCV.  The  first-generation  assay  for  antibody  against 
HCV  resulted  in  a  further  70%  decrease  in  hepatitis  incidence 
to  a  residual  rate  of  1.5%,  and  a  more-sensitive  second-genera¬ 
tion  assay,  introduced  in  1992,  nearly  eliminated  HCV  trans¬ 
mission  (Fig.  2).  Although  mathematical  modeling  indicates 
that  antibody-screened  blood  might  still  transmit  HCV  to 
1:100,000  to  1:200,000  recipients,  the  observed  decrease  from 
33%  in  1970  to  nearly  zero  in  1997  stands  as  a  testament  to  the 
cumulative  effectiveness  of  a  series  of  donor  screening  inter¬ 
ventions  that  were  evidence-based.  Viral  nucleic  acid  testing  of 
donors  and  improved  viral  inactivation  technologies  will  soon 
bring  transmission  of  hepatitis  and  human  immunodeficiency 
virus  from  near-zero  to  absolute  zero.  1  am  now  looking  for  an¬ 
other  line  of  work. 
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The  hepatitis  C  virus:  A  new  paradigm  for  the  identification  and  control  of  infectious  disease 

Identification  of  the  hepatitis  C  virus  immunoscreening  approach  applied  to 

The  problem  of  non-A,non-B  (NANB)  he-  MiCHAEL  HoUGHTON  recombinant  Xgtll  (ref.  24)  cDNA  li- 

patitis  emerged  in  1975,  after  serological  braries  prepared  from  total  RNA  and  DNA 

tests  for  hepatitis  A  virus  (HAV)  and  hepatitis  B  virus  (HBV)  were  extracted  from  infectious  chimpanzee  plasma^^.  Serum  from  a 
developed.  It  then  became  evident  that  most  hepatitis  cases  patient  diagnosed  with  NANB  hepatitis  was  used  as  a  presumed 
after  transfusion  were  not  due  to  either  HAV  or  HBV,  and  that  (but  unproven)  source  of  NANB-hepatitis-specific  antibodies  to 
the  risk  of  NANB  hepatitis  after  blood  transfusions  was  as  high  identify  just  one  viral  cDNA  clone  from  a  complex  cDNA  library 
as  10%  or  even  greate^^  Later,  it  also  became  evident  that  NANB  constituting  one  million  other  cDNAs.  Formal  proof  of  its  etio- 
hepatitis  occurred  frequently  in  the  form  of  sporadic,  commu-  logical  origin  came  from  the  demonstration  that  the  clone  was 
nity-acquired  infections.  A  frustrating  period  of  13  years  fol-  not  derived  from  the  host  genome,  that  it  bound  a  large  RNA 
lowed,  in  which  the  methods  successfully  used  to  identify  HAV  molecule  of  around  10,000  nucleotides  found  only  in  NANB-he- 
and  HBV  all  failed  to  result  in  the  molecular  identification  of  patitis-infectious  materials,  and  that  it  was  derived  from  a  posi- 
the  etiological  agent(s)  of  NANB  hepatitis.  No  NANB-hepatitis-  tive-stranded  RNA  encoding  a  protein  that  induced  antibodies 
specific  antigen,  antibody  or  cell  culture  system  was  identified,  only  in  NANB  hepatitis-infected  individuals^^'^^  The  RNA 
and  this  lack  of  a  molecular  'handle'  limited  progress  in  identi-  genome  also  encoded  a  large  polyprotein  of  about  3,000  amino 
tying  the  causative  agent(s)  of  NANB  hepatitis^^.  However,  a  acids  that  had  distant  primary  sequence  identity  with  members 
chimpanzee  model  successfully  developed  by  several  groups"'^®'^'  of  the  Flaviviridae  family"".  HCV  was  therefore  identified  by  di- 
was  exploited  to  show  that  one  NANB  hepatitis  agent  induced  rect  molecular  cloning  of  its  genome  in  the  relative  absence  of 
characteristic  membranous  tubules  within  the  endoplasmic  knowledge  concerning  the  nature  of  the  infectious  agent  and 
reticulum  of  infected  chimpanzee  hepatocytes"".  Known  as  the  the  immune  response.  This  'blind'  method  could  be  of  value  in 
tubule-forming  agent,  it  was  later  shown  to  be  filterable  and  to  the  future  in  unearthing  other  unknown  infectious  agents  in- 
lose  infectivity  after  treatment  with  organic  solvents,  consistent  volved  in  disease.  The  molecular  identification  of  HCV  was  the 
with  its  being  a  lipid-enveloped  virus,  possibly  a  toga-like  or  culmination  of  a  team  effort"'  spanning  7  years,  during  which 
flavi-like  virus^'"^  Other  data  supported  the  existence  of  an  HBV-  hundreds  of  millions  of  bacterial  cDNA  clones  were  screened  for 

like  NANB  hepatitis  agent^^  as  well  as  a  chloroform-resistant  a  putative  NANB  hepatitis  origin  using  many  different  ap- 
(non-enveloped)  NANB  hepatitis  virus""  and  possibly  other  proaches.  The  successful  approach  involved  Qui-Lim  Choo  in 
NANB  hepatitis  agents"".  my  Laboratory  at  Chiron  and  the  laboratories  of  George  Kuo 

Eventually,  what  turned  out  to  be  the  main  form  of  parenter-  (Chiron)  and  Daniel  Bradley  (CDC).  I  accept  the  award  on  be- 
ally  transmitted  NANB  hepatitis  was  identified  using  a  'blind'  half  of  these  collaborators  (Fig.  3). 


emerged  and  the  'surrogate'  assays  were  introduced  in  concert 
with  more-intensive  questioning  of  donors  regarding  high-risk 
behavior  and  by  a  lessened  use  of  allogeneic  blood.  Nonetheless, 
we  could  show  that  these  combined  measures  served  to  decrease 
hepatitis  incidence  to  4.5%  by  1989  (Fig.  2).  Efforts  to  develop  a 
specific  NANB  assay  continued  throughout  the  1980s,  although 
the  main  effort  by  Chiron  was  kept  well  concealed. 

During  this  time,  I  had  developed  a  panel  of  sera  consisting  of 
duplicate  coded  samples  that  had  been  proved  to  be  infectious  in 
the  chimp  or  non-infectious  in  humans.  By  1989,  many  different 
laboratories  claimed  to  have  developed  a  NANB  assay  and  asked 
to  test  the  panel.  None  was  able  to  break  the  code  and  by  1989, 
the  score  was  viruses,  20;  investigators,  zero.  At  that  time,  I  re¬ 
ceived  a  call  from  George  Kuo  at  Chiron,  saying  that  they  too  felt 
they  had  a  NANB  assay.  I  sent  George  the  remnants  of  the  now- 
dwindling  panel  and  within  days  received  their  results  followed 
by  several  anxious  calls  asking  if  I  had  yet  broken  the  code. 
When  I  did,  I  was  excited  to  find  that  Chiron  had  detected  all  but 
two  of  the  infectious  sera  and  had  properly  found  all  the  non-in- 
fectious  sera  to  be  negative.  Further,  the  two  samples  that  they 
missed  were  acute-phase  sera,  and  subsequent  samples  from 
these  same  patients  proved  to  be  positive  for  what  Chiron  now 
called  the  hepatitis  C  virus.  Michael  Houghton  will  describe  the 
events  that  preceded  this  discovery. 

Using  the  newly  developed  assay  for  antibodies  against  HCV, 
we  again  delved  into  our  repository  and  were  able  to  rapidly 
show  that  88%  of  NANB  hepatitis  cases  seroconverted  for  anti¬ 
body  against  HCV,  that  the  development  of  antibody  was  in 
temporal  relationship  to  the  course  of  hepatitis  and  that  in- 
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have  been  developed,  based  on  the  highly  con¬ 
served  5'  internal  ribosome  entry  site  and  nucle- 
ocapsid  gene  sequences,  which  can  diagnose 
new  infections  before  seroconversion  to  having 
antibodies  against  HCV  occurs.  Implementation 
of  these  tests  for  screening  of  blood  donors  will 
reduce  the  risk  of  HCV  after  transfusion  still  fur¬ 
ther  (down  to  approximately  one  in  300,000  in 
the  US,  for  example).  A  test  detecting  circulating 
nucleocapsid  antigen  is  also  of  value  in  diagnos¬ 
ing  infection  before  seroconversion. 


Properties  of  HCV 

Now  known  to  contain  a  highly  variable  RNA  genome,  HCVs 
constitute  a  large  genus  (the  hepacivirus  genus)  within  the 
Flaviviridae  family.  Six  basic  genotypes  have  been  distinguished 
so  far,  with  more  than  100  phylogenetically-distinct  subtypes^^ 
At  any  one  time,  the  viral  genome  exists  as  a  complex  quasi¬ 
species.  The  RNA  genome  contains  a  conserved  5'-terminal  inter¬ 
nal  ribosome  entry  site  that  is  responsible  for  initiating 
translation  of  the  large  polyprotein.  The  latter  is  cleaved  co-  and 
post-translationally  into  at  least  three  structural  or  virion  proteins 
and  seven  presumed  non-structural  proteins  involved  in  replica¬ 
tion  of  the  virus^^’^^^  (Fig.  4).  The  3'  terminus  of  the  RNA  genome  is 
composed  of  a  variable  region,  a  polypyrimidine  tract  and  a  highly 
conserved  stem-loop  secondary  structure^^  Hypervariable  regions 
exist  within  the  large  gpE2  glycoprotein  domain  that  may  be 
under  immune  selection^ \  The  virus  cannot  be  grown  efficiently 
in  cell  culture  or  purified  from  infected  liver  or  blood,  and  thus  still 
has  not  been  characterized  morphologically  or  biochemically. 

Serodiagnosis 

The  molecular  cloning  of  the  HCV  genome  led  to  the  availability 
of  many  recombinant  HCV  diagnostic  antigens.  George  Kuo  puri¬ 
fied  these  and  developed  numerous  experimental  EIA  tests  de¬ 
tecting  HCV  antibodies.  This  intensive  work  allowed  the  selection 
of  optimal,  immunodominant  epitopes  for  inclusion  in  an  evolv¬ 
ing  series  of  sensitive  and  specific  blood  screening  and  diagnostic 
tests  for  HCV  infection^^.  Used  to  screen  blood  donors  beginning 
in  1990,  these  assays  have  led  to  the  near-disappearance  of  trans- 
fusion-associated  hepatitis  C.  At  least  40,000  infections  have  been 
prevented  each  year  in  the  US  alone  since  the  implementation  of 
these  tests^^.  Such  tests  have  also  been  of  great  use  in  diagnosing 
hepatitis  patients  and  in  their  clinical  management,  and  in  at¬ 
tributing  liver  and  extra-hepatic  diseases  to 
HCV  infection.  Chronic  hepatitis,  liver  cirrho¬ 
sis,  hepatocellular  carcinoma,  cryoglobuline¬ 
mia  and  porphyria  cutanea  tarda  are  all 
well-established  potential  clinical  sequelae  of 
chronic  persistent  HCV  infection.  Other  dis¬ 
eases,  such  as  oral  lichen  planus,  Sjogren's-like 
s)mdrome  and  non-Hodgkin  lymphoma, 
have  also  been  linked  with  HCV  infection^®. 

Although  most  HCV-infected  individuals 
have  few  clinical  symptoms  and  will  not 
progress  to  a  severe  disease  state,  a  subset  can 
undergo  progressive  liver  disease  in  which, 
often  over  decades,  chronic  hepatitis  develops 
into  liver  cirrhosis  and  then  into  hepatocellu¬ 
lar  carcinoma.  Extra-hepatic  manifestations 
can  also  occur. 

Recently,  various  nucleic  acid  testing  assays 


Education,  new  therapies  and  vaccines 

Future  challenges  exist  for  both  developed  and  developing  coun¬ 
tries.  In  the  latter,  global  implementation  of  blood  donor  screen¬ 
ing  for  HCV  has  been  recommended  recently  by  the  World  Health 
Organization.  In  many  countries  in  which  HCV  is  endemic,  the 
risks  of  HCV  infection  after  transfusion  are  still  exceedingly  high. 
Also,  the  World  Health  Organization  has  emphasized  education  to 
lower  the  risks  of  HCV  transmission  in  both  developing  and  devel¬ 
oped  countries.  The  historical  use  of  non-sterile  injection  devices 
has  been  mainly  responsible  for  the  huge  burden  of  HCV  disease 
present  in  many  developing  countries,  as  well  as  cultural  practices 
(such  as  circumcision)  involving  the  use  of  non-sterile  medical 
equipment.  In  developed  countries,  intravenous  drug  use  involv¬ 
ing  sharing  of  needles/syringes  is  still  the  main  risk  factor^^.  Any 
procedure  involving  blood  transfer  (such  as  tattooing  using  shared 
instruments)  is  not  recommended.  The  Centers  for  Disease 
Control  also  recommends  that  HCV-infected  individuals  not  share 
toothbrushes,  razors  and  so  on. 

The  current  therapy  for  HCV  consists  of  a  combination  of  in¬ 
terferon  and  ribovirin^^  However,  both  drugs  can  produce  sub¬ 
stantial  toxicity,  and  only  a  minority  of  patients  responds.  In 
particular,  long-term  response  rates  with  the  most  common 
genotype,  type  1,  occur  in  only  approximately  30%  of  patients. 
Although  the  imminent  introduction  of  a  more  stable  form  of 
pegylated  interferon  will  improve  response  rates  somewhat,  it  is 
apparent  that  more-effective  and  less-toxic  drugs  are  required. 
The  HCV  genome  encodes  two  proteases  involved  in  processing 
of  the  viral  pol5q)rotein:  a  helicase  involved  in  unwinding  the 
RNA  strands  during  replication  and  translation,  and  a  replicase 
that  copies  the  positive  RNA  strand  (Fig.  4).  The  fine  structures  of 
all  these  enzymes  have  now  been  resolved  by  X-ray  diffraction 
methods  and  are  now  the  subjects  of  rational  drug  design^^'^®. 
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Figure  4  The  organization  of  the  hepatitis  C  positive-stranded  RNA  genome. 
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Other  therapeutic  developments  involve  ribozyme  and  antisense 
strategies  involving  the  conserved  5'  internal  ribosome  entry  site 
and  nucleocapsid  gene  sequences,  and  nucleoside  analog  in¬ 
hibitors  of  the  replicase.  A  putative  receptor  for  HCV,  the  CD81 
tetraspanin  molecule,  is  also  the  subject  of  potential  antiviral  de¬ 
velopment^^.  Therefore,  we  can  be  optimistic  that  new,  specific 
drugs  against  HCV  will  emerge  within  the  next  5-10  years. 

Between  12  and  50%  of  acute  HCV  infections  spontaneously 
resolve  without  progressing  to  the  chronically  infected  state  that 
is  associated  with  the  pathogenic  sequelae  of  infection.  Such  res¬ 
olution  of  acute  infection  is  associated  with  the  induction  of 
broad  helper  and  cytotoxic  T-lymphocyte  responses  to  the 
virus^^.  Thus,  appropriate  vaccination  to  prime  such  immune  re¬ 
sponses  may  lower  the  high  chronicity  rate  associated  with  HCV 
infection.  Vaccination  of  chimpanzees  with  recombinant  enve¬ 
lope  glycoproteins  gpEl  plus  gpE2  successfully  prevented  the  de¬ 
velopment  of  chronic  infection  in  most  animals  after  challenge 
with  homologous  or  heterologous  subtype  la  virus.  In  contrast, 
most  control  unvaccinated  animals  develop  chronic,  persistent 
infections^^.  These  data  are  encouraging  for  the  development  of 
human  vaccines.  Human  immunoglobulin  preparations  (con¬ 
taining  antibodies  against  HCV)  have  been  reported  to  be  effec- 
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live  at  reducing  the  rate  of  development  of  chronic  infection  in 
the  transfusion  setting,  between  sexual  partners  and  in  liver 
transplantation^^  Response  to  interferon  has  also  been  linked 
with  the  endogenous  level  of  intrahepatic  HCV-specific  cyto¬ 
toxic  T-lymphocyte  activity  before  treatment^®.  If  it  is  confirmed, 
appropriate  vaccination  may  also  be  important  therapy  if  such 
virus-specific  cytotoxic  T-lymphocyte  activity  can  be  boosted 
during  drug  therapy. 

Finally,  although  HCV  is  an  RNA  virus  that  does  not  produce 
DNA  replication  intermediates  that  can  integrate  into  the  host 
genome,  it  still  manages  to  persist  in  most  cases  (usually  for  life 
in  untreated  individuals).  Although  it  may  involve  the  emer¬ 
gence  of  viral  escape  mutants  to  both  antibody  and  T-cell  re¬ 
sponses,  it  is  very  likely  that  additional  mechanisms  are  in 
operation  to  result  in  such  high  rates  of  chronicity.  Elucidating 
these  mechanisms  represents  the  most  intriguing  challenge  of 
future  HCV  research  and,  in  the  process,  is  likely  to  open  up  new 
strategies  for  the  control  of  this  challenging  virus. 
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From  cell  physiology  to  cell  physiology 


I  was  born  in  Germiston,  a  small  town 
outside  Johannesburg,  South  Africa,  on  SYDNEY 

13  January  1927.  Most  of  my  early  edu¬ 
cation  was  obtained  from  the  local  Carnegie  Public  Library, 
where  1  first  learned  how  to  acquire  knowledge  from  books — 
something  that,  unfortunately,  has  almost  completely  disap¬ 
peared  from  the  modern  world.  A  bursary  from  the  local  town 
council  enabled  me  to  enter  medical  studies  at  the  University 
of  Witwatersrand  in  1942.  1  deviated  into  a  medical  science 
course  after  two  years,  when  it  was  discovered  that  I  would  be 
too  young  to  qualify  as  a  doctor  at  the  end  of  the  six-year 
course.  1  did  two  additional  years  of  medical  sciences,  and  my 
MSc  thesis  was  on  the  chromosomes  of  Elephantulus,  an  insec- 
tivore.  1  taught  myself  this  field  by  reading  Darlington's 
Recent  Advances  in  Cytology,  and  I  became  a  skilled  micro- 
scopist  and  histologist  both  from  my  work  and  from  working 
part-time  as  a  technician  in  the  Anatomy  Department. 

When  I  began  my  research,  1  already  had  several  years  of 
post-graduate  experience  in  'garage'  chemistry  and  1  had 
added  some  biochemistry  to  my  interests.  I  decided  that  cells 
were  the  important  biological  entities  to  study,  particularly 
living  cells,  and  the  subject  1  decided  to  take  up  was  one  1 
called  cell  physiology.  I  began  to  build  various  pieces  of 
equipment  that  I  thought  1  would  need.  One  was  a  Warburg 
respirometer,  but  my  glassblowing  was  not  very  inspired.  The 
other,  more  successful,  was  a  small  Beams- 
and-King  air  turbine  ultracentrifuge.  1  used 
this  to  produce  the  first  paper^  1  was  really 
proud  of  and  still  am.  In  it,  I  showed  that  the 
material  called  'chromidial  substance',  which 
stained  red  with  methyl  green  pyronin  and 
contained  RNA,  corresponded  to  the  fraction 
that  Claude  had  separated  from  ground-up 
cells  and  called  microsomes.  These  were  char¬ 
acterized  by  having  certain  sedimentation 
properties  in  extracts.  I  showed,  by  centrifug¬ 
ing  small  pieces  of  rat  liver  at  300,000^  in  my 
small  turbine,  that  the  material  identified  cy- 
tologically  had  the  same  properties  in  each 
cell  as  the  bulk  material  in  the  extract. 

I  returned  to  complete  my  medical  studies  Sydney  Brenner. 

in  1947,  but  I  was  not  an  exemplary  medical  _ 

student.  Once,  when  a  spherical  thoracic  sur¬ 
geon  uttered  the  remark  that  surgery  was  an  exact  science  like 
physics  and  chemistry,  I  burst  into  uncontrollable  hysterical 
laughter  and  was  ordered  to  leave.  I  never  returned  to  that 
part  of  the  course,  and  so  that  part  of  the  body  remained  a 
mystery.  Of  course,  as  a  skilled  human  anatomist,  I  knew  and 
still  know  what  is  in  the  thorax,  but  I  never  learned  how  to 
open  it. 

At  the  same  time  that  I  was  studying  medicine  as  well  as  re¬ 
searching  and  teaching,  I  continued  my  fanatical  collection 
of  knowledge,  nearly  all  self-taught.  When  I  read  that  solu¬ 
tions  of  DNA  were  thixotropic,  I  studied  rheology,  but  was 
disappointed  because  the  subject  seemed  to  have  more  to  do 


Sydney  Brenner 


.  with  the  food  industry  and  how  you 

RENNER  make  porridge  than  with  genes  and 

cells.  I  also  tried  to  prepare  myself  for 
the  future  by  learning  mathematics,  but  it  was  not  clear  what 
would  be  needed  and  so  I  learned  a  little  of  everything.  I  con¬ 
tinued  my  earlier  studies  in  chemistry  by  synthesizing  dyes, 
and  published  three  papers  on  supravital  staining^  L 

All  of  this  prepared  me  for  a  career  in  molecular  biology, 
but  the  subject  had  still  not  been  invented  when  I  came  to 
Oxford  in  1952  to  study  bacteriophage  in  the  physical  chem¬ 
istry  laboratory.  I  worked  with  Sir  Cyril  Hinshelwood,  who 
had  written  a  book  called  The  Physical  Chemistry  of  the 
Bacterial  Cell,  which  I  thought  was  close  enough  to  the  yet- 
unborn  subject.  I  had  my  own  ideas  about  how  DNA  might  be 
related  to  proteins  (these  were  wrong),  and  I  thought  we 
could  study  the  structure  of  DNA  by  optical  methods  with 
acridine  dyes,  a  topic  later  revisited  in  a  different  form^-^.  Jack 
Dunitz  and  Leslie  Orgel,  whom  I  met  at  Oxford,  eventually 
led  me  to  the  Cavendish  Laboratory  in  Cambridge.  There,  in 
April  1953,  I  met  Francis  Crick  and  James  Watson  and  the 
DNA  model.  I  knew  then  and  there  exactly  what  I  was  going 
to  do  for  the  rest  of  my  life. 

I  have  written  elsewhere  about  the  exhilaration  of  belong¬ 
ing,  in  those  early  days  of  molecular  biology,  to  a  radical 
evangelical  secC.  It  was  composed  mostly  of  physicists,  and  I 
was  the  only  failed  thoracic  surgeon  among 
them.  Like  most  of  my  colleagues  I  read 
Schroedinger's  What  is  Life,  but  in  1946,  I 
didn't  understand  it.  After  reading  von 
5  Neumann's  work  on  self-reproducing  ma- 
I  chines  in  1952,  I  knew  where  Schroedinger 
I  had  gone  wrong.  Schroedinger  thought  that 
I  the  genetic  material  contained  both  the  pro- 
f  gramme  for  development  as  well  as  the 
i  means  for  its  execution,  whereas  von 
I  Neumann  showed  that  it  contains  only  a  de- 
I  scription  of  the  means  for  executing  the  pro- 
I  gramme  (see  ref.  8). 

I  For  the  last  50  years  or  so,  all  my  scientific 
I  work  has  been  directed  toward  determining 
how  the  genetic  components  in  biological 
_  systems  are  implemented  to  generate  organ¬ 
isms  with  complex  structures  and  functions. 
This  remains  the  central  problem  of  biology,  and  is  the  same 
whether  one  studies  bacteriophage  or  bacteria,  or  nematodes, 
fish  or  human  beings.  It  is  the  same  whether  one  does  classi¬ 
cal  experimental  genetics  or  one  sequences  genomes.  Today, 
when  we  have  organized  large  groups  to  sequence  the  human 
genome,  and  high  throughput  is  on  everyone's  lips,  and 
when  we  are  puzzled  by  how  we  are  going  to  integrate  all  of 
this  information,  I  return  to  the  thought  I  had  quite  early  that 
genes  are  not  the  units  of  development  or  function.  Cells  are 
the  units,  and  our  job  is  to  discover  how  the  products  of  the 
genes  work  in  cells  to  govern  their  activities. 

I  wrote  the  following  as  an  introduction  to  a  Ciba 
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Symposium  in  1978  (ref.  9):  "The  great  ease  with  which  mol¬ 
ecular  information  can  be  collected  on  the  genomes  of 
higher  organisms  will  tempt  many.  We  can  inevitably  expect 
vast  compendia  of  sequences  but,  without  functional  refer¬ 
ence,  these  compendia  will  be  uninterpretable,  like  an  unde¬ 
ciphered  ancient  language.  Many  people  and  many 
computers  will  play  games  with  these  sequences,  but  we  will 
have  to  find  out  by  experiment  what  the  sequences  do  and 
how  the  products  they  make  participate  in  the  physiology 
and  development  of  the  organism.  Thus,  although  the  analy¬ 
sis  of  the  genotype  has  been  taken  care  of,  we  still  need  better 
ways  of  analyzing  phenotypes.  Many  of  us  are  ultimately  in¬ 
terested  in  the  causal  analysis  of  development  and  the  reduc¬ 
tion  of  the  complex  phenotypes  of  higher  organisms  to  the 
level  of  gene  products.  This  is  still  the  major  problem  of  biol¬ 
ogy.  We  must  understand  what  cells  can  do  because  all  of 
what  we  are  is  generated  by  cells  growing,  moving  and  differ¬ 
entiating." 

This  is  still  valid  today. 
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osmium  tetroxide  in  0.05  M  cacodylate  buffer  pH  7,0  at  4‘’C  for  2h,  rinsed  in  0.025  M 
sodium  phosphate  (pH  7.0),  dehydrated  in  a  graded  ethanol  series  at  4°C  and  critical- 
point-dried  in  a  Polaron  E3000  Critical  Point  Drier  using  CO>.  Seedlings  were  mounted  on 
stubs,  and  organs  obscuring  the  apical  meristem  were  removed.  Specimens  were  coated 
with  20  nm  of  gold-palladium  in  a  Polaron  E5000  Sputter  Coater,  and  examined  on  a 
Philips  XL30FEG  scanning  elearon  microscope  at  5kV. 

Statistical  analysis 

Statistical  tests  were  pjcrformed  using  GraphPad  Instat  Version  3.0  (GraphPad  Software). 
Received  12  November  1 999;  accepted  23  March  2000. 
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Protein  degradation  by  the  ubiquitin  system  controls  the  intra¬ 
cellular  concentrations  of  many  regulatory  proteins.  A  protein 
substrate  of  the  ubiquitin  system  is  conjugated  to  ubiquitin 
through  the  action  of  three  enzymes,  El,  E2  and  E3,  with  the 
degradation  signal  (degron)  of  Ae  substrate  recognized  by  E3 
(refs  1”3).  The  resulting  multi-ubiquitylated  substrate  is 
degraded  by  Ae  26S  proteasome^.  Here  we  describe  Ae  physiolo¬ 
gical  regulation  of  a  ubiquitin-dependent  paAway  Arou^  allo¬ 
steric  modulation  of  its  E3  activity  by  small  compounds.  Ubr  1,  Ae 
E3  enzyme  of  Ae  N-end  rule  pathway  (a  ubiquitin-dependent 
proteolytic  system)  m  Saccharomyces  cerevisiae  mediates  Ae 
degradation  of  Cup9,  a  transcriptional  repressor  of  Ae  peptide 
transporter  Ptr2  (ref.  5).  Ubrl  also  targets  proteins  Aat  have 
destabilizing  amino-terminal  residues*.  We  show  Aat  Ae 
degradation  of  Cup9  is  allosterically  activated  by  dipeptides 
wiA  destabilizing  N-terminal  residues.  In  the  resulting  positive 
feedback  circuit,  imported  dipeptides  bind  to  Ubrl  and  accelerate 
Ae  Ubrl -dependent  degradation  of  Cup9,  Aereby  de-repressing 
Ae  expression  of  Ptr2  and  mcreasing  Ae  cell*s  capacity  to  import 
peptides.  These  findings  identify  the  physiological  rationale  for 
Ae  targeting  of  Cup9  by  Ubrl ,  and  indicate  Aat  small  compounds 
may  regulate  oAer  ubiquitin-dependent  paAways. 

The  rate  of  degradation  of  specific  proteins  is  often  regulated  by 
modulating  Ae  exposure  or  Ae  structure  of  Aeir  degrons.  For 
example,  Ae  degrons  of  Ae  cyclin-dependent  kinase  inhibitors  Sicl 
and  p27  are  activated  by  phosphorylation,  which  is  timed  to  bring 
about  their  destruction  at  key  transition  points  in  Ae  cell  cycled  In 
other  cases,  phosphorylation  regulates  Ae  activity  of  an  E3  itself. 
For  example,  Ae  anaphase-promoting  complex,  a  multisubunit  E3, 
is  activated  only  at  mitosis®.  One  ubiquitin-dependent  proteolytic 
system,  caUed  the  N-end  rule  pathway,  targets  proteins  carrying  a 
degradation  signal  called  the  N-degron*^,  which  comprises  a  desta¬ 
bilizing  N-terminal  residue  and  a  lysine  residue^”'”.  In  5.  cerevisiae, 
there  are  two  classes  of  destabilizmg  residues,  basic  or  ‘type  V  (Arg, 
Lys  and  His),  and  bulky  hydrophobic  or  ‘type  2’  (Phe,  Leu,  Tyr,  Trp 
and  He).  Ubrl,  a  R1NG-H2  finger- containing  E3  of  relative 
molecular  mass  225,000  (Mr  225K),  directly  recognizes  Aese 
N-terminal  residues**'".  A  complex  of  Ubrl  and  the  E2  enzyme 
Rad6  (Ubc2)  synAesizes  a  multi- ubiquitin  chain  linked  to  a  lysine 
residue  of  Ae  substrate' ^  Dipeptides  with  a  destabilizing  N-termmal 
residue  of  either  basic  or  hydrophobic  type  act  as  competitive 
inhibitors  of  Ae  degradation  of  N-end  rule  substrates  carrying 
the  same  type  of  destabilizing  residue'"'”'*.  Thus,  Ubrl  contains  two 
distinct  N-termmal  residue-binding  sites  that  are  each  capable  of 
binding  either  a  dipeptide  or  a  protein,  but  not  boA  at  the  same 
time. 

The  N-end  rule  pathway  was  discovered  Arough  Ae  use  of 
engineered  reporter  proteins^.  The  first  physiological  function  of 
Ubrl  m  S.  cerevisiae  has  recently  been  shown  to  be  Ae  regulation  of 
peptide  uptake'",  through  control  of  degradation  of  the  Mr  35K 
homeodomam  protem  Cup9,  a  transcriptional  repressor  of  the  di- 
and  tripeptide  transporter  Ptr2  (ref.  5).  Ubrl  targets  Cup9  through 
a  degron  located  in  Ae  carboxy-termmal  half  of  Cup9  (F.Navarro- 
Garcia,  G.C.T.  and  A.V.,  unpublished  data).  Despite  Ais  unexpected 
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mode  of  recognition,  we  asked  whether  dipeptides  bearing 
destabilizing  N-terminal  residues  could  affect  the  Ubrl -mediated 
degradation  of  Cup9,  as  dipeptides  are  able  to  inhibit  degradation  of 
canonical  N-end  rule  substrates 

We  addressed  this  question  using  Cup9nsf>  a  C-terminally  Flag- 
tagged  variant  of  the  Cup9  repressor  bearing  an  Asn265— ►Ser 
substitution.  Cup9nsf  degraded  indistinguishably  from  wild- 
type  Cup9  but  was  predicted  to  have  a  much  lower  affinity  for 
so  it  would  not  influence  the  expression  of  Ptr2  and  the 
uptake  of  dipeptides.  Cup9nsp  was  expressed  as  part  of  a  fusion  of 
the  form  Flag-DHFR-ubiquitin-Cup9NSF  (where  Flag-DHFR  is 
N-terminally  Flag-tagged  mouse  dihydrofolate  reductase;  see 
Methods;  Fig.  1).  Ubiquitin-specific  proteases  co-translationally 
cleave  this  UPR  (ubiquitin-protein-reference)  fusion  at  the 
ubiquitin~Cup9  junction,  yieling  the  test  protein  Cup9nsf  and 
the  long-lived  Flag-DHFR-ubiquitin  reference  protein,  which 
serves  as  an  internal  control  for  variations  in  expression  levels  and 
immunopr ecipitation  efficiency^®’^  ‘ . 

Cells  expressing  Cup9nsf  were  grown  in  minimal  medium  con¬ 
taining  allantoin  as  the  nitrogen  source  to  avoid  the  known  effects  of 
nitrogen  cataboUte  repression  on  PTR2  expression^’.  Leu-Ala  and 
Arg-Ala,  dipeptides  bearing  either  type  of  destabilizing  N-terminal 
residue  (Leu,  bulky  hydrophobic;  Arg,  basic)  were  added  to  a  final 
concentration  of  10  mM  (see  Methods).  This  dipeptide  concentra¬ 
tion  results  in  maximal  inhibition  of  degradation  of  N-end  rule 
substrates’^.  Notably,  the  addition  of  either  Leu- Ala  or  Arg-Ala 
exerted  an  opposite  effect  on  Cup9ksf,  strongly  accelerating  its 
degradation  in  wild-type  (UBRl)  cells.  The  half-life  of  Cup9nsf 
decreased  from  ~5  min  in  the  absence  of  dipeptides  (Fig.  Ic)  to  less 
than  1  min  in  their  presence  (Fig.  lb).  This  stimulatory  effect  was 
not  observed  in  a  ubrl  A  strain,  indicating  that  the  augmented 
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Figure  1  Enhancement  of  Cup9  degradation  by  dipeptides  with  destabilizing  N-terminal 
residues,  a,  The  fusion  protein  used  for  pulse-chase  analysis.  The  stable  Flag-DHFR- 
ubiquitin  reference  portion  of  the  fusion  is  co-translationally  cleaved  from  Cup9nsf  by 
ubiquitin-specific  protease  (UBP).  b,  Pulse-chase  analysis  of  Flag-DHFR-ubiquitin - 
Cup9nsf  in  the  presence  of  various  dipeptides  at  a  concentration  of  10  mM.  Dipeptides 
with  either  basic  (Arg-Ala)  or  bulky  hydrophobic  (Leu-Ala)  destabilizing  N-terminal 
residues  strongly  enhance  Cup9j^F  degradation,  but  only  in  strains  expressing  Ubrl . 
Dipeptides  bearing  a  stabilizing  N-terminal  residue  (Ala-Arg  and  Ala-Leu)  do  not  alter 
Cup9nsf  degradation,  c,  Effects  of  different  concentrations  of  Trp-Ala  on  the 
enhancement  of  CupOnsf  degradation  in  wild-type  {UBR1)  cells.  Dash  indicates  that  pulse- 
chase  analysis  was  performed  in  the  absence  of  added  dipeptides.  Enhancement  of 
Cup9nsf  degradation  was  detectable  at  1  p,M  Trp-Ala  Ub’’^®.  ubiquitin  containing  mutation 
of  Lys48toArg  (ref.  20). 


degradation  of  Cup9nsf  was  dependent  on  Ubrl.  The  enhancement 
of  degradation  required  dipeptides  with  destabilizing  N-terminal 
residues;  dipeptides  of  the  same  composition  but  bearing  a  stabihz- 
ing  residue  (Ala-Leu  and  Ala-Arg)  did  not  affect  the  degradation  of 
Cup9nsf  (^1/2  min)  (Fig.  lb,  and  data  not  shown).  Similar  results 
were  obtained  with  cells  expressing  Cup9ksf  that  was  not  a  part  of  a 
UPR  fusion  (data  not  shown). 

To  determine  the  concentration  dependence  of  the  stimulation, 
we  measured  the  degradation  of  Cup9  at  a  range  of  concentrations 
of  Trp-Ala.  The  enhancement  of  Cup9NSF  degradation  was  detect¬ 
able  at  1  jJiM  Trp-Ala,  the  lowest  concentration  tested  (fi/2  *^1  min) 
(Fig.  Ic).  In  contrast,  the  degradation  of  Cup9nsf  was  not  altered  by 
either  Ala-Trp  or  the  constituent  amino  acids  Trp  and  Ala  (Fig.  Ic). 
Similar  results  were  obtained  using  Leu- Ala  and  Arg— Ala  (data  not 
shown).  These  results  indicate  that  the  relevant  signalling  molecule 
in  this  process  is  a  dipeptide  bearing  a  destabilizing  N-terminal 
residue.  The  range  of  dipeptide  concentrations  that  significantly 
stimulated  Cup9  degradation  was  similar  to  physiologically  active 
levels  of  many  other  nutrients. 

Cup9  represses  transcription  of  the  transporter-encoding  PTR2 
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Figure  2  Effects  of  dipeptides  on  expression  of  the  dipeptide  transporter  gene  PTR2. 
a,  Induction  of  P7R?  expression  by  dipeptides  bearing  destabilizing  N-terminal  residues 
{Trp-Ala  and  Arg-Ala)  required  both  UBR1  and  CUPS.  Dipeptides  bearing  a  stabilizing  N- 
terminal  residue  (Ala-Trp  and  Ala-Arg)  had  no  effect  on  P77?2  expression.  P77?2mRNA 
and  the  ACT1  mRNA  loading  control  are  indicated,  b,  Effect  of  different  concentrations  of 
Trp-Ala  on  the  levels  of  PTR2  mRNA. 
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Figure  3  In  wfroubiqultylation  of  Cup9  is  enhanced  by  dipeptides  bearing  destabilizing  N- 
terminal  residues.  Reactions  consisting  of  purified  Ubal  {E1).  Rad6  (E2),  Ubrl  (E3), 
ubiquitin,  ATP  and  radiolabelied  Cup9  were  supplemented  with  the  indicated  dipeptides 
or  amino  acids  (top  panel,  2  \lU,  bottom  panel,  10  piM),  or  left  unsupplemented  (-),  and 
allowed  to  proceed  at  30  °C  for  the  designated  times.  Radiolabelied  input  Cup9,  and  its 
multi- ubiquitylated  derivatives  are  indicated.  Ub,  ubiquitin. 
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gene^.  Thus,  the  dipeptide-induced  acceleration  of  Cup9  degrada¬ 
tion  would  be  expected  to  increase  the  levels  of  PTR2  messenger 
RNA,  ultimately  leading  to  an  increase  in  dipeptide  uptake.  We 
tested  this  by  examining  the  levels  of  PTR2  mRNA  in  the  presence  or 
absence  of  dipeptides  in  the  medium.  At  25  |xM,  both  Trp-Ala  and 
Arg-Ala  induced  PTR2  expression  in  the  wild- type  {UBRl)  strain 
(Fig.  2a).  Both  Ubrl  and  Cup9  were  required  for  these  effects,  as  the 
expression  of  PTR2  was  not  altered  by  dipeptides  in  ubrlA  and 
cup9A  strains.  Induction  of  PTR2  mRNA  could  be  observed  at  1  |xM 
Trp-Ala,  and  increased  at  higher  dipeptide  concentrations  (Fig.  2b), 
in  agreement  with  the  observed  changes  in  the  half-life  of  Cup9 
(Fig.  Ic). 

A  plausible  mechanism  of  the  enhancement  effect  is  that  a 
dipeptide  interacts  with  either  the  basic  or  the  hydrophobic  N- 
terminal  residue-binding  sites  of  Ubrl,  and  a  distinct  (third) 
substrate-binding  site  of  Ubrl  recognizes  the  internal  degron  of 
Cup9.  In  this  model,  the  interaction  of  Ubrl  with  dipeptides 
aUosterically  increases  the  ability  of  the  Ubrl-Rad6  (E3-E2)  com¬ 
plex  to  ubiquitylate  Cup9.  To  test  whether  dipeptides  act  directly 
through  Ubrl,  we  examined  the  effect  of  dipeptides  on  Cup9 
ubiquitylation  in  an  in  vitro  system  consisting  of  the  following 
purified  components:  Ubrl  (E3),  Rad6  (E2),  Ubal  (El),  ubiquitin, 
ATP  and  radiolabelled  Cup9.  Cup9  was  significantly  multi-ubiqui- 
tylated,  in  a  Ubrl/Rad6-dependent  reaction  (data  not  shown),  in 


the  absence  of  added  dipeptides  (Fig.  3).  This  result  was  consistent 
with  the  relatively  rapid  in  vivo  degradation  of  Cup9p  (ti/2  '^5  min) 
in  the  absence  of  dipeptides  (Fig.  Ic). 

The  addition  of  dipeptides  bearing  either  type  of  destabilizing  N- 
terminal  residue  to  the  in  vitro  system  substantially  stimulated  the 
Ubrl -dependent  multi-ubiquitylation  of  Cup9  (Fig.  3).  Dipeptides 
of  the  same  composition  but  bearing  a  stabilizing  N-terminal 
residue  did  not  stimulate  multi-ubiquitylation,  nor  did  the 
amino-acid  components  of  these  dipeptides  (Fig.  3).  These  results 
demonstrate  that  dipeptides  act  directly  through  Ubrl,  without  an 
intermediate  signalling  pathway.  The  underlying  allosteric  mechan¬ 
ism  may  involve  increased  affinity  of  Ubrl  for  Cup9,  or  enhanced 
ubiquitylation  activity  of  the  Ubrl-Rad6  complex  towards  Cup9, 
or  both. 

This  work  shows  that  the  two  binding  sites  of  Ubrl  that  interact 
with  destabilizing  N-terminal  residues  can  act  as  allosteric  sites  that 
enable  Ubrl  to  sense  the  presence  of  imported  dipeptides  and  to 
accelerate  degradation  of  the  Cup9  repressor,  resulting  in  an 
appropriate  induction  of  the  Ptr2  transporter  (Fig.  4a-d).  This 
model  predicts  that  a  dipeptide  bearing  a  destabilizing  N-terminal 
residue,  for  example  Leu-Ala,  should  stimulate  its  own  uptake,  in 
contrast  to  Ala-Leu.  This  prediction  was  borne  out  when  we  tested 
the  ability  of  these  two  leucine-containing  dipeptides  to  support  the 
growth  of  S.  cerevisiae  auxotrophic  for  leucine  (Fig.  4e). 
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Figure  4  Feedback  regulation  of  peptide  import  in  S.  cerevisiae.  a,  The  peptide  transport 
circuit,  b,  Ubrl  is  required  for  dipeptide  uptake.  In  the  absence  of  Ubr1  {ubr1A),  the 
transcriptional  repressor  Cup9  is  long-lived,  accumulates  to  high  levels  and  extinguishes 
expression  of  the  P77?2gene.  Thus,  ubr1A  celis  cannot  import  dipeptides  (red  dots),  c,  In 
a  wild-type  [UBRI)  cell  growing  in  the  absence  of  extracellular  dipeptides,  Ubr1  targets 
Cup9  for  degradation  {fv2  min),  resulting  in  a  moderate  concentration  of  Cup9  and 
weak  but  significant  expression  of  the  Ptr2  transporter  (blue  double  ovals),  d,  In  wild-type 
{UBRf}  cells  growing  in  the  presence  of  extracellular  dipeptides,  some  of  which  bear 
destabilizing  N-terminal  residues,  the  imported  dipeptides  bind  to  either  the  basic  (type  1 ; 
red)  or  the  hydrophobic  (type  2;  green)  residue-binding  site  of  Ubrl .  Binding  of  either  type 


of  dipeptide  to  Ubr1  aUosterically  increases  the  rate  of  Ubrl -mediated  degradation  of 
Cup9.  Peptides  of  both  types  are  shown  bound  to  Ubr1 ,  but  the  binding  of  either  peptide 
accelerates  Cup9  degradation.  The  resulting  decrease  of  the  half-life  of  Cup9  from 
~5min  to  <1  min  results  in  a  very  low  concentration  of  Cup9,  and  consequently  to 
strong  induction  of  the  Ptr2  transporter,  e,  Colony  formation  assays.  An  S.  cerevisiae 
strain  requiring  leucine  for  growth  was  incubated  on  plates  supplemented  with  either 
dipeptides  or  their  amino-acid  constituents  at  the  following  concentrations:  Leu-Ala, 
230  |ulM  (2  puM  where  indicated):  Ala-Leu,  230  piM;  Arg-Ala,  10  fjiM:  Ala-Arg,  10  |xM; 
Lys-Ala,  1  puM;  Ala-Lys,  1  piM;  Arg  &  Ala,  10  piMeach,  Lys  &  Ala,  1  puM  each. 
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Food  sources  that  S.  cerevisiae  encounters  outside  the  laboratory 
setting  are  likely  to  contain  mixtures  of  short  peptides,  a  subset  of 
which  would  be  capable  of  activating  the  Ubrl -based  positive 
feedback  circuit.  We  modelled  this  situation  by  providing  cells 
with  a  mixture  of  Leu— Ala  at  a  low  concentration  (2  p.M)  and 
Ala-Leu  at  a  high  concentration  (230  p-M).  Although  neither 
dipeptide  supplement  alone  could  satisfy  the  strain’s  requirement 
for  leucine,  a  mixture  of  the  two  dipeptides  supported  robust 
growth  (Fig.  4e).  Moreover,  a  mixture  of  Ala-Leu  (230  fJiM)  and 
a  dipeptide  lacking  leucine  but  bearing  a  destabilizing  N-terminal 
residue  (10  |xM  Arg-Ala  or  1  piM  Lys-Ala)  also  rescued  growth.  In 
contrast,  dipeptides  bearing  stabilizing  N-terminal  residues,  or  the 
amino-acid  components  of  these  dipeptides,  could  not  rescue  the 
growth  of  Leu"  cells  in  the  presence  of  230  fxM  Ala-Leu  (Fig.  4e). 

This  work  establishes  for  the  first  time  that  the  activity  of  an  E3 
can  be  directly  linked  to  the  presence  of  an  environmental  signal 
through  an  allosteric  interaction  with  a  small  compound.  Specifi¬ 
cally,  dipeptides  bearing  destabilizing  N-terminal  residues  are 
shown  to  act  as  allosteric  activators  of  Ubrl,  enhancing  its  ability 
to  support  the  ubiquitylation  and  degradation  of  Cup9.  Physiolo¬ 
gically,  this  results  in  a  positive  feedback  circuit  governing  the 
uptake  of  peptides  (Fig,  4).  Through  their  binding  to  Ubrl,  the 
imported  dipeptides  accelerate  degradation  of  Cup9,  thereby  dere¬ 
pressing  the  synthesis  of  the  Ptr2  transporter  and  enhancing  the 
cell’s  ability  to  import  di-  and  tripeptides.  As  most  cells  have  the 
capacity  to  import  peptides,  the  results  of  this  work  suggest  that 
peptide  import  maybe  regulated  similarly  by  Ubrl  homologues  in 
metazoans^^  and  by  the  ClpAP-dependent  N-end  rule  pathway  in 
Escherichia  coli^^.  The  ubiquitin  system  is  either  known  or  suspected 
to  be  important  in  the  control  of  intermediary  metabolism  and  the 
transport  of  small  molecules  across  membranes*’^.  Our  findings 
suggest  that  these  compounds,  or  their  enzymatically  produced 
derivatives,  may  modulate  the  functions  of  E3s  in  the  ubiquitin 
system  similarly  to  the  effects  observed  here  with  dipeptides  and 
Ubrl.  □ 

Methods 

Yeast  strains  and  plasmids 

The  S.  cerevisiae  strains  used  in  pulsc-chase  experiments,  JD52  {MATa  lys2-801  i{ra3-52 
trpl~A63  his3-A200  Ieu2-3J12),  JD55  ( whrJA::HIS3),  have  been  described'^  Flag- 
DHFR-ubiquitin-Cup9Nsp  was  expressed  from  the  PMtT25  promoter  on  the  centromeric 
vector  p416MET25  (ref.  26).  Construction  details  arc  available  upon  request.  .Strains  used 
for  northern  analyses  were  AVY30  {MATa  leu2-3,112  MhriA;;LE t/2),  AVY  31  (MATa  leu2~ 
3,112  cup9A::LEU2)  and  AVY32  {MATa  LEU2),  constructed  in  the  RJD350  background 
(MATa  leii2-3,112\  a  gift  from  R.  Deshaies)  using  restriction  fragments  obtained  from 
plasmids  pSOB30  (ref.  6),  pCBI19  (ref.  5)  and  plJ252  (ref.  27),  respectively.  The  RJD350 
strain  was  used  for  the  colony  formation  assay  (Fig.  4e).  SHM  plates^  were  supplemented 
with  dip>cptides  or  amino  acids  at  the  following  concentrations:  230  p.M  Leu-Ala  (or 
2  p.M  where  indicated),  230  p,M  Ala- Leu,  230  p,M  each  of  Leu  and  Ala,  10  |jt.M  Arg-Ala, 
10  p,M  Ala-Arg,  1  p-M  Lys-Ala,  1  p-M  Ala-Lys,  10  p,M  each  of  Arg  and  Ala,  or  1  p.M  each 
of  Lys  and  Ala. 

Pulse-chase  analysis 

Cells  were  cultured  in  SHM'  with  auxotrophic  supplements.  Dipeptides  were  added  to 
cultures  at  an  absorbance  of  ~0.6  at  600nm  and  incubation  continued  for  2.5  h  in  the 
experiments  in  Fig.  lb,  and  for  30  min  for  Fig.  Ic.  Cells  were  harvested,  washed  in  0.8  ml  of 
SHM,  resuspended  in  0.4  ml  of  SHM,  and  labelled  for  5  min  at  30  “C  with  0.16  mCi  of '^'^S- 
EXPRESS  (New  England  Nuclear).  Cells  were  pelleted  and  resuspended  in  fresh  SHM 
containing  4  mM  L-methionine  and  2mM  i-cysteine.  Samples  (0.1ml)  were  taken  at  the 
time  points  indicated  and  transferred  to  chilled  tubes,  each  containing  0.5  ml  of  0.5-mm 
glass  beads,  0.7  ml  of  ice-cold  lysis  buffer  (1%  Triton-XlOO,  0.15  M  NaCI,  5mM  EDTA, 
50  mM  sodium  HEPES,  pH  7.5),  and  a  mixture  of  protease  inhibitors  (final  concentra¬ 
tions  ImM  phenylmethylsulphonyl  fluoride,  2p,gmr*  aprotinin,  0.5p,gmr‘  leupeptin 
and  0.7  p.g  ml"'  pepstatin).  Extracts  were  prepared  and  immu noprecipitations  carried 
out  as  described'%  using  anti-Flag  M2  resin  (Sigma).  Immunoprecipitates  were  fractio¬ 
nated  by  13%  SDS-poIyacrylamide  gel  electrophoresis  (PAGE),  and  detected  by  auto¬ 
radiography. 

RNA  preparation  and  northern  analysis 

Cells  were  cultured  in  SHM  to  an  absorbance  of  ~0.6  at  600  nm.  The  indicated  dipeptides 
were  then  added  to  a  25  p.M  final  concentration,  and  the  incubation  was  continued  for  an 


additional  30  min.  Total  RNA  was  prepared"*  and  25  p,g  samples  were  elect rophoresed  in 
1%  formaldehyde-agarose  gels,  followed  by  blotting  for  northern  analysis^. 

Ubrl -dependent  in  vhro  ubiquitylation  system 

The  components  of  this  system  were  purified  as  follows.  N-terminally  hexahistidine- 
tagged  Ubal  was  overexpressed  in  S.  cerevisiae  and  purified  by  fractionation  over  Ni-NTA, 
ubiquitin  affinity  and  Superdex-200  columns.  Rad6  was  overexpressed  in  E.  coli  and 
purified  by  fractionation  over  DEAE,  Mono-Q  and  Superdex-75  columns.  N-terminally 
Flag-tagged  Ubrl  was  overexpressed  in  S.  cerevisiae,  and  purified  by  fractionation  over 
anti-Flag  M2,  Ubc2p  affinity  and  Superdex-200  columns.  N-terminally  Flag-tagged,  C- 
terminally  hexahist  id  in  e- tagged  Cup9  was  expressed  and  radiolabellcd  in  £.  coli,  and 
purified  by  fractionation  over  Ni-NTA  and  anti-Flag  M2  columns.  Details  of  these 
expression  and  purification  protocols  arc  available  upon  request. 

The  ift  vitro  ubiquitylation  reactions  contained  the  following  components:  7  jiM 
ubiquitin,  50 nM  Ubal,  50 nM  Rad6,  50 nM  Ubrl,  550 nM  ^^S-labelled  Cup9,  25 mM 
HEPES/KOH  (pH 7.5),  25 niM  KCl,  5 mM  MgCi,,  2  mM  ATP,  0.1  mM  dithiothreitol  and 
0.5  mgmr'  ovalbumin  as  a  carrier  protein.  Dipeptides  or  amino  acids,  as  indicated,  were 
added  to  a  final  concentration  of  2  p-M  (top  panel.  Fig.  3)  or  10  p,M  (bottom  panel,  Fig.  3). 
All  components  except  Ubal  were  mixed  on  ice  for  10 min;  Ubal  was  then  added  and 
reactions  shifted  to  30  “C.  After  the  indicated  times,  the  reactions  were  terminated  by 
adding  an  equal  volume  of  two  times  SDS-PAGE  loading  buffer  and  heating  at  95  for 
5  min,  followed  by  8%  SDS-PAGE. 
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The  ubiquitin  (Ub)  system  recognizes  degradation  signals  of  the 
target  proteins  through  the  E3  components  of  E3-E2  Ub  ligases.  A 
targeted  substrate  bears  a  covalently  linked  multi-Ub  chain  and  is 
degraded  by  the  ATP-dependent  26S  proteasome,  which  consists 
of  the  20S  core  protease  and  two  19S  particles.  The  latter  mediate 
the  binding  and  unfolding  of  a  substrate  protein  before  its  transfer 
to  the  interior  of  the  20S  core.  It  is  unclear  how  a  targeted  substrate 
is  delivered  to  the  26S  proteasome,  inasmuch  as  RpnIOp,  the  only 
known  proteasomal  subunit  that  binds  multi-Ub  chains,  has  been 
found  to  be  not  essential  for  degradation  of  many  proteins  in  the 
yeast  Saccharomyces  cerevisiae.  Here  we  show  that  Ubr1p  and 
Ufd4p,  the  E3  components  of  two  distinct  Ub  ligases,  directly 
interact  with  the  26S  proteasome.  Specifically,  Ubr1p  is  shown  to 
bind  to  the  Rpn2p,  Rptip,  and  Rpt6p  proteins  of  the  19S  particle, 
and  Ufd4p  is  shown  to  bind  to  RptSp.  These  and  related  results 
suggest  that  a  substrate-bound  Ub  ligase  participates  in  the  de¬ 
livery  of  substrates  to  the  proteasome,  because  of  affinity  between 
the  iigase's  E3  component  and  specific  proteins  of  the  19S  particle. 

E3  1  N-end  rule  |  ubiquitin  fusion  degradation  pathway  |  Ubrip  |  Ufd4p 

Regulated  proteolysis  by  the  ubiquitin  (Ub)  system  plays 
essential  roles  in  the  ceil  cycle,  differentiation,  stress  re¬ 
sponses,  and  many  other  processes  (1-5).  Ub  is  a  76-residue 
protein  whose  covalent  conjugation  to  other  proteins,  usually  in 
the  form  of  a  multi-Ub  chain,  marks  these  proteins  for  processive 
degradation  by  the  26S  proteasome,  an  ATP-dependent  multi¬ 
subunit  protease  (3,  6-8).  The  conjugation  of  Ub  to  other 
proteins  involves  the  formation  of  a  thioester  between  the  C 
terminus  of  Ub  and  a  specific  cysteine  of  the  Ub-activating  (El) 
enzyme.  The  Ub  moiety  of  El'-^Ub  thioester  thereafter  is 
transesterified  to  a  specific  cysteine  in  one  of  several  Ub- 
conjugating  (E2)  enzymes.  The  Ub  moiety  of  E2~Ub  thioester 
is  conjugated,  via  the  isopeptide  bond,  to  the  c-amino  group  of 
either  a  substrate's  Lys  residue  or  a  Lys  residue  of  another  Ub 
moiety,  the  latter  reaction  resulting  in  a  substrate-linked 
multi-Ub  chain  (9-11). 

Most  E2  enzymes  function  in  complexes  with  proteins  called  E3. 
The  functions  of  E3s  include  the  initial  recognition  of  degradation 
signals  (degrons)  in  the  substrate  proteins,  with  different  E3s 
recognizing  different  classes  of  degrons.  The  E2-E3  complexes, 
referred  to  as  Ub  ligases,  mediate  the  formation  of  substrate-linked 
multi-Ub  chains  (10, 12, 13).  The  ATP-dependent  26S  proteasome, 
which  processively  degrades  a  targeted,  ubiquitylated"^  substrate, 
consists  of  the  20S  core  protease  and  two  19S  particles  (6,  14).  A 
19S  particle  mediates  the  binding  and  unfolding  of  a  substrate 
protein  before  its  transfer  to  the  interior  of  the  20S  core  (6, 15-19). 

It  is  unclear  how  a  targeted  substrate  is  delivered  to  the  26S 
proteasome,  inasmuch  as  RpnIOp  (Mcblp/Sunlp),  the  only  known 
proteasomal  subunit  that  binds  multi-Ub  chains  (20),  is  not  essential 
for  degradation  of  many  proteins  in  the  yeast  Saccharomyces 
cerevisiae  (21). 

Here  we  show  that  Ubrip  and  Ufd4p,  the  E3  components  of 
two  distinct  Ub-dependent  proteolytic  pathways  (13,  22,  23), 
directly  interact  with  the  26S  proteasome.  Specifically,  Ubrip  is 
shown  to  bind  to  the  Rpn2p  (Sen3p),  Rptip  (Cim5p),  and  Rpt6p 
(Cim3p/Suglp)  proteins  of  the  19S  particle,  and  Ufd4p  is  shown 


to  bind  to  Rpt6p.  These  results  suggest  a  mechanism  for  the 
delivery  of  protein  substrates  to  the  proteasome. 

Materials  and  Methods 

Protein  Expression  in  S.  cerevisiae  and  Escherichia  co/i.  Plasmid 
construction  protocols  are  available  on  request.  S,  cerevisiae  RPNl 
was  isolated  as  a  multicopy  suppressor  of  the  toxicity  of  co¬ 
overproduced  Ubrip  and  Rad6p  (Ubc2p),  which  were  expressed 
from  the  bidirectional  Pgau,io  promoter  as  described  (24).  RPN2, 
RPNIO,  and  RPN12  with  their  own  promoter  regions,  and  the  ORFs 
oiRPNS,  RPN11,RPT1,  RPT2,  RPT6,  andPREb  were  amplified  by 
PCR  from  the  DNA  of  5.  cerevisiae  YPH500.  All  PCR  products 
were  verified  by  DNA  sequencing.  For  the  toxicity  suppression 
assays  (Fig.  1),  RPNl,  RPNl,  RPNIO,  and  RPNI2  were  subcloned 
into  the  high-copy  vector  pRS425  (25).  The  RPTI  and  RPT6  ORFs 
were  subcloned  into  the  low-copy  vector  pRS314CUPl  derived 
frompRS314  (25),  yielding  p314CUPlRPTl  andp314CUPlRPT6, 
in  which  RPTI  and  RPT6  were  expressed  from  the  Pcupi  promoter. 
For  coimmunoprecipitation  assays,  the  N-termini  of  Rpt6p,  Pre6p, 
and  Ufd4p  were  extended  with  the  hemagglutinin  (HA)  epitope 
(23);  alternatively,  the  FLAG  epitope  was  added  to  the  C  termini 
of  Rpt6p  and  Rpnlp  (26).  For  glutathione  5-transferase  (GST)- 
pulldown  assays,  the  ORFs  ofRPNI,  RPN2,  RPN3,  RPNIO,  RPNll, 
RPN12,  RPTI,  RPT2,  RPT6,  and  PRE6  were  fused  in-frame  to  the 
3'  end  of  GST-coding  sequence  in  pGEX-4T-3  (Amersham  Phar¬ 
macia).  The  C-terminally  FLAG-tagged  alleles  of  RPNl  and 
RPNIO  were  subcloned  into  pET-llc  (Novagen).  E.  coli  BL21 
(DE3)  (26)  was  used  to  express  the  GST  fusions,  as  well  as 
Rpnlp-FLAG  and  RpnlOp-FLAG, 

Pulse-Chase  and  GST-Pulldown  Assays.  The  Ub  protein  reference 
(UPR)-based  plasmids  expressing  dihydrofolate  reductase 
(DHFR)-HA-Ub^'^^-X-/3gal  (j3-galactosidase)  (see  Results  for 
notations)  were  described  (27).  The  pulse-chase  procedures 
(Fig.  2)  were  described  as  well  (22,  28).  In  GST-pulldown  assays, 
«==!  /jLg  of  a  GST  fusion  protein  or  GST  alone  was  diluted  to  0.5 
ml  in  the  loading  buffer  (1%  Triton  X-100/10%  glycerol/0.5  M 
NaCl/1  mM  EDTA/50  mM  TriS'HCl,  pH  8.0),  and  incubated 
with  10  [A  (bed  volume)  of  the  glutathione-agarose  beads 
(Sigma)  for  1  h  at  4°C.  The  beads  were  washed  three  times  with 


Abbreviations:  Ub,  ubiquitin;  HA,  hemagglutinin;  GST,  glutathione  S-transf erase;  UPR,  Ub 
protein  reference;  DHFR,  di hydrofolate  reductase;  ^gal,  /3-gaIactosidase;  UFD,  Ub  fusion 
degradation. 

*To  whom  reprint  requests  should  be  addressed  at:  Division  of  Biology,  147-75,  California 
Institute  of  Technology,  1200  East  California  Boulevard,  Pasadena,  CA  91125.  E-mail; 
avarsh@caltech.edu. 

^Ubiquitin  whose  C-terminal  (Gly-76)  carboxyl  group  is  covalently  linked  to  another  com¬ 
pound  is  called  the  ubiquityl  moiety,  the  derivative  terms  being  ubiquitylation  and 
ubiquitylated.  The  term  Ub  refers  to  both  free  ubiquitin  and  the  ubiquityl  moiety.  This 
nomenclature  (5),  which  also  is  recommended  by  the  Nomenclature  Committee  of  the 
International  Union  of  Biochemistry  and  Molecular  Biology  (41),  brings  ubiquitin-related 
terms  in  line  with  the  standard  chemical  terminology. 
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UBR1/UBC2  UBR1/UBC2+RPT1  UBR1/UBC2+RPN1+RPT1 


UBR1/UBC2+RPN1  UBR1/UBC2+RPT6  UBR1/UBC2+RPNURPT6 


Fig.  1.  Toxicity  of  overexpressed  N-end  rule  pathway  is  decreased  by  overexpressed  Rpnip  and  enhanced  by  overexpressed  Rptip  or  Rpt6p.  The  5.  cerevisiae 
strain  JD52  (23)  was  transformed  with  pKIVl1313,  a  low-copy,  URAS-based  plasmid  expressing  UBR1  and  from  the  bidirectional  Pgali.io  promoter  (24).  The 
pKM  1 3 1 3-contain  ing  JD52  cells  were  transformed  either  with  the  pRS425  vector  {A),  RPN1  (expressed  from  its  own  promoter  in  pRS425)  (S),  RPT1  (expressed  from 
the  PcuPi  promoter  in  p314CUP1RPTl)  (Q,  or  RPT6  (expressed  from  the  Pcupi  promoter  in  p314CUP1RPT6)  (D).  (f  and  F)  JD52  cells  carrying  both  pKM1313  and 
the  /?PA/7-expressing  plasmid  were  further  transformed  with  p314CUP1  RPT1  (£)  and  p314CUP1RPT6  (f).  respectively.  Shown  here  are  plates  with  cells  grown  in 
the  galactose-containing  minimal  medium  containing  0.2  mM  CuS04for  4  days  at  30°C  On  dextrose-containing  plates  (no  overexpression  of  Ubr1  p  and  RadSp), 
all  transformants  grew  at  similar  rates,  irrespective  of  the  presence  of  overexpressed  Rpnip,  Rpti  p,  or  RptSp  (data  not  shown). 


1  ml  of  the  binding  buffer  (0.05%  Triton  X-100/10%  glycer¬ 
ol/  50  mM  NaCl/ 50  mM  Na-Hepes,  pH  7.5).  The  washed  beads 
were  incubated  with  cell  extracts  containing  FLAG-Ubrlp  (Fig. 
3A),  or  with  purified  FLAG-Ubrlp  (Fig.  3B),  or  with  cell  extracts 
containing  either  Rpnlp-FLAG  or  RpnlOp-FLAG  (Fig.  4 A  and 
B),  or  HA-Ufd4p  (Fig.  5A)  at  4°C  for  2  h.  The  incubation  buffer 
for  the  assay  with  purified  FLAG-Ubrlp  contained  ovalbumin 
(Sigma)  at  1  mg/ml  in  the  binding  buffer.  The  beads  were 
washed  three  times  with  the  binding  buffer,  followed  by 
SDS/PAGE  of  the  retained  proteins  and  immunoblotting  with 


ABC 

-Vector  RPN1  vector  RPNl 


lu  su  u  jg  JU  0  TO  30  0  10  30  0 - ^0 - 20 - 

Chase  time  (min)  Chase  time  (min)  Chase  time  (min) 


Fig.  2.  Overexpression  of  Rpnip  Inhibits  degradation  of  N-end  rule  sub¬ 
strates.  A  U PR-based  pulse- chase  assay  was  used  (see  Results).  S.  cerevisiae 
JD52  {UBR1)  ceils  coexpressing  either  DHFR-HA-Ub'^^^-Leu-jSgal  (A)  or  DHFR- 
HA-Ubf^^8.His-^gaI  {B),  and  either  Rpn  1  p  (expressed  from  its  own  promoter  in 
pRS425)  or  pRS425  alone  were  labeled  with  P^S]methionine  for  5  min  at  30°C, 
followed  by  a  chase  for  0,  10,  and  30  min.  Cell  extracts  were  immunoprecipi- 
tated  with  both  anti-)3gal  and  anti-HA  antibodies,  followed  by  SDS/12% 
PAGE,  autoradiography,  and  quantitation  using  Phosphorlmager  (Molecular 
Dynamics).  (0  His-/3gal  and  Leu-/3gal  decay  curves  calculated  from  the  UPR- 
based  data  In  A  and  B  as  described  (28,  32).  The  bands  of  X-jSgals  and  the 
reference  protein  DHFR-HA-Ub*^'*^  ("ref")  are  indicated. 


either  anti-FLAG  antibody  (Sigma)  or  anti-HA  antibody 
(Babco,  Berkeley,  CA). 

Coimmunoprecipitation/lmmunoblottmg.  FLAG-Ubrlp,  bearing 
the  N-terminal  FLAG  epitope,  was  expressed  from  the  Padhi 
promoter  in  the  low-copy  vector  pRS315  (25).  The  N-terminally 
HA-tagged  HA-Rpt6p,  HA-Pre6p,  and  HA-Ufd4p  and  the 
C-terminally  FLAG-tagged  Rpt6p-FLAG  were  expressed  from 
the  induced  Pcupj  promoter  in  the  low-copy  vector  pRS314  (25). 
The  C-terminally  FLAG-tagged  Rpnlp-FLAG  was  expressed 
from  its  own  promoter  in  pRS425.  S.  cerevisiae  was  grown  in 
synthetic-dextrose  medium  (26, 28)  containing  0.2  mM  CUSO4  to 
ODsoo  *=*1-0.  Cells  were  disrupted  by  vortexing  with  glass  beads 
in  0.05%  Triton  X-100, 50  mM  NaCl,  50  mM  Na-Hepes  (pH  7.5) 
containing  several  protease  inhibitors  (27), 

Results 

Ubr1p,  the  E3  of  the  N-End  Rule  Pathway,  Binds  to  the  Proteasome. 

Ubrlp  (also  called  N-recognin),  the  225K  E3  of  the  S.  cerevisiae 
N-end  rule  pathway,  targets  proteins  that  bear  certain  (desta¬ 
bilizing)  N-terminal  residues.  Two  substrate-binding  sites  of 
Ubrlp  recognize  two  classes  of  destabilizing  N-terminal  resi¬ 
dues:  basic  and  bulky  hydrophobic  (13).  Yet  another  substrate¬ 
binding  site  of  Ubrlp  targets  proteins  that  bear  internal  (non- 
N- terminal)  degrons  (29).  Similar,  but  distinct,  versions  of  the 
N-end  rule  pathway  are  present  in  all  organisms  examined,  from 
prokaryotes  to  fungi  and  mammals  (13).  Previous  work  has 
shown  that  overexpression  of  the  pathway’s  Ub  ligase,  through 
co-overexpression  of  its  E2  (Rad6p,  also  called  Ubc2p)  and  E3 
(Ubrlp)  components,  is  toxic  to  S.  cerevisiae  (24). 

In  the  present  work,  we  found  that  RPNl  {NASI  /HRD2)  (30, 
31),  which  encodes  a  ^110  K  protein  of  the  proteasome’s  19S 
particle,  was  a  multicopy  suppressor  of  the  toxicity  of  co- 
overexpressed  Ubrlp  and  Rad6p  (Fig.  1 A  and  B,  and  data  not 
shown).  Because  Rpnip  is  essential  for  cell  viability  (30),  it  could 
be  that  the  toxicity  of  co-overexpressed  Ubrlp  and  Rad6p 
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Fig.  3.  Ubrip,  the  E3  of  the  N-end  rule  pathway.  Is  physically  associated  with  the  26S  proteasome.  {A  and  B)  Ubrip  interacts  with  Rpn2p,  Rptip,  and  Rpt6p 
in  GST-pulldown  assays.  Extracts  of  S.  cerevisiae  containing  overexpressed  FLAG-Ubr1p  (A)  or  the  purified  FLAG-Ubrip  protein  (B)  were  Incubated  with 
glutathione-agarose  beads  preloaded  with  the  indicated  GST  fusions.  The  retained  proteins  were  eluted,  fractionated  by  SDS/8%  PAGE,  and  Immunoblotted 
with  anti-FLAG  antibody.  Approximately  equal  amounts  of  different  GST  fusions  were  Immobilized  on  glutathione-agarose  beads  in  these  assays,  as  verified  by 
Coomassie  staining  (data  not  shown).  (C  and  D)  In  vivo  association  of  Ubrip  and  Rpt6p.  Extracts  of  S.  cerevisiae  AVY107  {ubrIA)  expressing  either  both 
FLAG-Ubr1p  and  HA-Rpt6p,  or  FLAG-Ubrip  alone,  or  HA-RptGp  alone  were  Incubated  with  anti-FLAG  antibody  (O  or  anti-HA  antibody  (D).  The  immunopre- 
cipitated  proteins  were  separated  by  SDS/12%  PAGE  and  transferred  to  nitrocellulose  membrane.  The  top  halves  of  Cand  D  show  the  results  of  immunoblotting 
with  anti-FLAG  antibody;  the  bottom  halves  show  the  analogous  data  with  anti-HA  antibody,  (f)  Coimmunoprecipitation  of  Pre6p  and  Ubrip.  Extracts  of 
S.  cerevisiae  AVY107  {ubr1A)  expressing  both  FLAG-Ubr1  p  and  HA-Pre6p,  FLAG-Ubri  p  alone,  or  HA-Pre6p  alone  were  incubated  with  anti-HA  antibody,  followed 
by  the  Immunopreclpitation/immunoblotting  described  In  C  and  D. 


resulted  in  part  from  overdegradation  of  Rpnlp,  which  would 
have  accounted  for  the  observed  alleviation  of  toxicity  through 
overexpression  of  Rpnlp.  However,  pulse-chase  assays  with 
Rpnlp  have  shown  it  to  be  a  long-lived  protein  in  either 
wild-type  cells  or  cells  co-overexpressing  Ubrip  and  Rad6p 
(data  not  shown).  It  also  was  found  that  overexpression  of  Rpnlp 
had  no  effect  on  the  expression  levels  of  co-overexpressed  Ubrip 
and  Rad6p  (data  not  shown). 

We  then  tested,  using  pulse-chase  assays,  whether  overex¬ 
pression  of  Rpnlp  perturbed  the  degradation  of  normally  short- 
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Fig.  4.  Rpnlp  interacts  with  Rptip,  RptGp,  and  RpnIOp.  f.  co//-expressed 
Rpnl  p-FLAG  (A)  and  Rpn10p-FLAG  (S),  both  carrying  the  FLAG  epitope  at  the  C 
terminus,  were  incubated  with  glutathione-agarose  beads  preloaded  with  the 
indicated  GST  fusions.  The  retained  proteins  were  fractionated  by  SDS/PAGE, 
followed  by  Immunoblotting  with  anti-FLAG  antibody.  Approximately  equal 
amounts  of  different  GST  fusions  were  immobilized  on  glutathione-agarose 
beads  in  these  assays,  as  verified  by  Coomassie  staining  (data  not  shown). 


lived  substrates  of  the  N-eiid  rule  pathway.  In  these  assays,  the 
E.  coli  /3gai-based  test  proteins  were  expressed  as  components  of 
fusions  of  the  form  DHFR-HA-Ub^'^^-X-jSgal,  where  X  was 
either  His,  a  type  1  (basic)  destabilizing  residue,  or  Leu,  a  type 
2  (hydrophobic)  destabilizing  residue  (13);  DHFR-HA  was  the 
C-terminally  HA-tagged  mouse  DHFR;  Ub^"^^  was  the  Ub 
moiety  containing  Arg  instead  of  Lys  at  position  48  (27,  28,  32); 
and  jSgal  was  E.  coli  Pgdl,  Deubiquitylating  enzymes  (33)  co- 
translationally  cleave  these  fusions  at  the  Ub-j3gal  junction, 
yielding  the  long-lived  UnFR-HA-Ub^"^^  reference  protein  and 
a  test  protein,  either  His-j8gal  or  Leu-j3gal.  A  reference  protein 
improves  the  assay's  accuracy  by  serving  as  a  built-in  control  for 
variations  in  the  expression  levels,  immunoprecipitation  effi¬ 
ciency,  and  other  sources  of  data  scatter  in  a  pulse -chase  assay. 
The  above  method  for  producing  a  reference  protein  is  called  the 
UPR  technique  (28,  32).  As  shown  in  'Fig.  2,  the  in  vivo 
degradation  of  both  His-)3gal  and  Leu-j8gal  substrates  of  the 
N-end  rule  pathway  was  significantly  impaired  in  the  presence  of 
overexpressed  Rpnlp. 

To  determine  whether  other  proteins  of  the  19S  particle  were 
similar  to  Rpnlp  in  its  ability  to  suppress  the  toxicity  of 
overexpressed  N-end  rule  pathway,  we  overexpressed  Rpn2p, 
RpnIOp,  and  Rpnl2p,  by  transforming  cells  that  co-overex- 
pressed  Ubrip  and  Rad6p  (from  the  Pgali.io  promoter)  with  a 
high-copy  vector  expressing,  separately,  Rpn2p,  RpnIOp,  and 
Rpnl2p  from  their  own  promoters.  We  also  tested  Rptip  and 
Rpt6p,  two  of  the  six  ATPases  of  the  19S  particle  (16),  by 
overexpressing  them  from  the  copper-induced  Pcupi  promoter 
in  a  low-copy  vector.  In  contrast  to  the  results  with  Rpnlp  (Fig. 
1  A  and  B),  none  of  these  proteins  could  alleviate,  upon 
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Fig.  5.  Ufd4p,  the  E3  of  the  UFD  pathway,  is  physically  associated  with  the 
proteasome.  (4)  Ufd4p  interacts  with  Rpt6p  in  the  GST-pulldown  assay.  Ex¬ 
tracts  of  S.  cerevisiae  containing  overexpressed  HA-Ufd4p  were  incubated 
with  glutathione-agarose  beads  preloaded  with  different  GST  fusions,  as 
indicated.  The  retained  proteins  were  eluted,  fractionated  by  5DS/8%  PAGE, 
and  immunoblotted  with  antl-HA  antibody.  Approximately  equal  amounts  of 
different  GST  fusions  were  immobilized  on  glutathione-agarose  beads  in 
these  assays,  as  verified  by  Coomassie  staining  (data  not  shown).  (S)  In  vivo 
association  of  Ufd4p  and  RptGp.  Extracts  of  5.  cerevisiae  JD52  {UBR1)  express¬ 
ing  both  HA-Ufd4p  and  Rpt6p-FLAG,  HA-Ufd4p  alone,  or  Rpt6p-FLAG  alone 
were  incubated  with  anti-HA  antibody.  The  immunoprecipitated  proteins 
were  separated  by  SDS/10%  PAGE  and  transferred  to  nitrocellulose  mem¬ 
brane.  (Upper)  The  results  of  immunoblotting  with  anti-HA  antibody.  {Lower) 
The  data  with  anti-FLAG  antibody.  (O  Coimmunoprecipitation  of  Rpnip  and 
Ufd4p.  Extracts  of  5.  cerevisiae  JD52  expressing  both  HA-Ufd4p  and  Rpnlp- 
FLAG,  HA-Ufd4p  alone,  or  Rpn1p-FLAG  alone  were  incubated  with  anti-HA 
antibody,  followed  by  the  Immunoprecipitation/immunoblotting  procedure 
described  in  B,  except  that  SDS/7%  PAGE  was  used. 


overexpression,  the  growth  defect  caused  by  co-overexpressed 
Ubrlp  and  Rad6p  (data  not  shown).  Moreover,  we  found  that 
overexpression  of  either  Rptlp  or  Rpt6p  had  an  effect  opposite 
to  that  of  overexpressed  Rpnip:  the  toxicity  of  co-overexpressed 
Ubrlp  and  Rad6p  was  higher  in  the  presence  of  overexpressed 
Rptlp  or  Rpt6p  (Fig.  1  C  and  D;  compare  with  Fig.  1 A  and  B), 
In  a  control  experiment,  the  over  expression  of  Rptlp  or  Rpt6p 
in  cells  that  did  not  co-overexpress  Ubrlp  and  Rad6p  was  not  at 
all  toxic  (data  not  shown).  These  results  suggested  that  Rptlp 
and  Rpt6p  may  interact  with  a  component(s)  of  the  N-end  rule 
pathway,  perhaps  with  Ubrlp,  the  pathway’s  E3.  To  verify  this 
conjecture,  we  carried  out  GST-pulldown  assays. 

In  these  experiments,  Ubrlp  was  N-terminally  tagged  with  the 
FLAG  epitope.  The  N-terminal  FLAG  tag  did  not  impair  the 
function  of  Ubrlp  (A.  Webster  and  A.V.,  unpublished  data). 
Rptlp  and  Rpt6p  were  expressed  in  E.  coli  as  fusions  to  the  C 
terminus  of  GST.  Extracts  of  S.  cerevisiae  overexpressing  FLAG- 
Ubrlp  were  incubated  with  glutathione-agarose  beads  pre- 
loaded  with  GST-Rptlp,  GST-Rpt6p,  or  GST  alone.  The  bound 
proteins  were  eluted,  fractionated  by  SDS/PAGE,  and  immu¬ 
noblotted  with  a  monoclonal  anti-FLAG  antibody.  Remarkably, 
GST-Rpt6p  was  found  to  bind  FLAG-Ubrlp;  a  smaller,  but 
significant,  amount  of  FLAG-Ubrlp  also  was  bound  by  GST- 
Rptlp,  whereas  no  FLAG-Ubrlp  was  bound  by  GST  alone  (Fig. 
3A,  lanes  2,  7,  and  8). 


Could  Ubrlp  interact  with  other  proteins  of  the  26S  protea¬ 
some  as  well?  Using  the  same  assay,  we  discovered  that  GST- 
Rpn2p  was  also  able  to  bind  FLAG-Ubrlp,  whereas  all  of  the 
other  tested  proteasomal  components  (GST-Rpnlp,  GST- 
Rpn3p,  GST-RpnlOp,  GST-Rpnllp,  GST-Rpnl2p,  GST-Rpt2p, 
and  GST-Pre6p,  the  latter  a  component  of  the  20S  core  protea¬ 
some)  did  not  bind  to  FLAG-Ubrlp  (Fig.  3/1),  Coomassie 
staining  of  the  eluted,  SDS/PAGE-fractionated  proteins  con¬ 
firmed  that  the  amounts  of  different  GST  fusions  prebound  to 
glutathione-agarose  beads  in  the  pulldown  assays  were  approx¬ 
imately  equal  (data  not  shown). 

To  determine  whether  Ubrlp  binds  to  Rptlp,  Rpt6p,  and  Rpn2p 
directly,  we  carried  out  pulldown  assays  with  purified  FLAG-Ubrlp 
(a  gift  from  F.  Du  and  A.  Webster,  California  Institute  of  Tech¬ 
nology)  (Fig.  3B).  FLAG-Ubrlp  was  overexpressed  in  5.  cerevisiae 
and  purified  in  three  consecutive  steps:  anti-FLAG  antibody  affin¬ 
ity  chromatography;  Rad6p  (E2)  affinity  chromatography;  and  gel 
filtration  (F.  Du,  A.  Webster,  and  A.V.,  unpublished  data).  In 
agreement  with  the  results  obtained  by  using  extracts  from  cells 
overexpressing  FLAG-Ubrlp  (Fig.  3/1),  the  purified  FLAG-Ubrlp 
bound  to  the  purified  GST-Rptlp,  GST-Rpt6p,  and  GST-Rpn2p, 
but  not  to  GST-Rpnlp,  GST-RpnlOp,  or  GST  alone  (Fig,  3B). 
FLAG-Ubrlp  used  in  this  assay  was  purified  from  5.  cerevisiae 
overexpressing  this  protein.  Although  FLAG-Ubrlp  was  at  least 
95%  pure  by  Coomassie  staining,  one  would  like  to  purify  5. 
cerevisiae  Ubrlp  from  a  noneukaryotic  host  as  well,  to  preclude  the 
unlikely  possibility  that  a  trace  amount  of  a  Ubrlp-interacting 
protein  in  the  preparation  of  purified  Ubrlp  could  mediate  the 
binding  of  Ubrlp  to  Rptlp,  Rpt6p,  and  Rpn2p.  Attempts  to 
overexpress  the  225K  Ubrlp  in  E.  coli  have  not  been  successful  thus 
far. 

Further  tests  used  a  coimmunoprecipitation/immunoblotting 
assay  to  examine  the  in  vivo  interaction  between  Ubrlp  and 
Rpt6p.  FLAG-Ubrlp  was  co-overexpressed  in  ubrl/\  S.  cerevi¬ 
siae  with  HA-Rpt6p,  which  bore  the  N-terminal  HA-derived 
epitope.  The  controls  included  congenic  cells  expressing  either 
FLAG-Ubrlp  alone  or  HA-Rpt6p  alone.  Proteins  were  immu¬ 
noprecipitated  from  cell  extracts  with  anti-FLAG  antibody, 
followed  by  SDS/PAGE  and  immunoblotting  with  either  an¬ 
ti-HA  antibody  (Fig.  3C  Lower)  or  anti-FLAG  antibody  (Fig.  3C 
Upper).  The  results  indicated  that  HA-Rpt6p  was  specifically 
coprecipitated  with  FLAG-Ubrlp  by  anti-FLAG  antibody.  We 
also  carried  out  the  reciprocal  coimmunoprecipitation  assay  and 
found  that  FLAG-Ubrlp  was  specifically  coprecipitated  with 
HA-Rpt6p  by  anti-HA  antibody  (Fig.  3D), 

In  the  in  vitro  pulldown  assays,  FLAG-Ubrlp  did  not  interact 
with  GST-Pre6p,  a  GST  fusion  to  a  protein  of  the  20S  core 
protease  (Fig.  3A,  lane  9).  To  determine  whether  Ubrlp  is 
associated  with  the  mature  26S  proteasome  in  vivo,  FLAG- 
Ubrlp  and  HA-Pre6p  (N-terminally  tagged  with  the  HA 
epitope)  were  co-overexpressed  in  ubrlHs  cells.  Proteins  were 
precipitated  from  cell  extracts  with  anti-HA  antibody  and  sep¬ 
arated  by  SDS/PAGE,  followed  by  immunoblotting  with  either 
anti-FLAG  or  anti-HA  antibody.  FLAG-Ubrlp  was  found  to  be 
specifically  coprecipitated  with  HA-Pre6p  by  anti-HA  antibody 
(Fig.  3D).  Thus,  GST-pulldown  assays  with  either  crude  or 
purified  FLAG-Ubrlp  (Fig.  3  A  and  B)  and  the  coimmunopre¬ 
cipitation  assays  (Fig.  3  C-E)  demonstrated  that  Ubrlp,  the  E3 
of  the  N-end  rule  pathway,  is  physically  associated  with  the  26S 
proteasome.  Moreover,  Ubrlp  was  found  to  interact  with  more 
than  one  protein  of  the  19S  particle  (Fig,  3  A  and  B). 

As  described  above,  overexpression  of  Rpnip,  a  non-ATPase 
component  of  the  19S  particle,  inhibited  the  activity  of  the  N-end 
rule  pathway  (Fig.  2).  What  might  be  the  mechanism  of  this 
inhibition?  Overexpression  of  Rpnip  apparently  did  not  have  a 
significant  effect  on  the  function  of  the  26S  proteasome,  inas¬ 
much  as  cells  overexpressing  Rpnip  were  not  hypersensitive  to 
either  an  arginine  analog  canavanine  or  high  temperature  (data 
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not  shown).  Hypersensitivity  to  these  stressors  is  characteristic  of 
yeast  strains  that  have  impaired  proteasome  components  (6, 14). 
Overexpression  of  Rpnlp  in  wild-type  cells  (in  the  absence  of 
co-overexpression  of  Ubrlp  and  Rad6p)  did  not  result  in  a 
growth  defect  (data  not  shown).  Furthermore,  GST-pulldown 
assays  indicated  that  GST-Rpnlp  did  not  directly  interact  with 
Ubrlp  (Fig.  3),  ruling  out  the  possibility  that  overproduced 
Rpnlp  may  decrease  the  toxicity  of  overexpressed  N-end  rule 
pathway  by  sequestering  the  overproduced  Ubrlp.  It  was  also 
unlikely  that  the  overproduced,  free  Rpnlp  competed  with  the 
26S  proteasome  by  interacting  with  substrate-linked  multi-Ub 
chains,  because  Rpnlp  was  unable  to  bind  isolated  multi-Ub 
chains  (data  not  shown). 

As  described  above,  overexpression  of  either  Rptlp  or  Rpt6p, 
the  ATPase  subunits  of  the  19S  particle,  was  found  to  increase 
the  toxicity  of  co-overexpressed  Ubrlp  and  Rad6p  (Fig.  1  C  and 
D;  compare  with  Fig.  lA).  Therefore,  we  examined  the  possi¬ 
bility  that  the  overexpressed  Rpnlp  may  alleviate  the  toxicity  of 
co-overexpressed  Ubrlp  and  Rad6p  by  sequestering  Rptlp  and 
Rpt6p,  both  of  which  were  shown  to  interact  with  Ubrlp  (Fig.  3). 
We  began  by  determining,  using  the  GST-pulldown  assay, 
whether  Rpnlp  could  directly  bind  to  Rptlp  and/or  Rpt6p. 
S.  cerevisiae  Rpnlp-FLAG  (bearing  the  C-terminal  FLAG 
epitope)  was  expressed  in  E.  coli  and  incubated  with  glutathione 
agarose  beads  containing  prebound  GST-Rptlp,  GST-Rpt6p,  or 
GST  alone.  The  bound  proteins  were  eluted,  separated  by 
SDS/PAGE,  and  immunoblotted  with  anti-FLAG  antibody. 
Rpnlp  was  found  to  interact  directly  with  both  GST-Rptlp  and 
GST-Rpt6p,  but  not  with  GST  alone  (Fig.  AA,  lanes  1,  5,  and  6). 

We  also  found  that  Rpnlp-FLAG  directly  interacted  with 
GST-RpnlOp,  the  protein  of  the  19S  particle  that  binds  to 
multi-Ub  chains  (Fig.  AA),  Rpnlp-FLAG  did  not  interact,  in  this 
assay,  with  other  tested  proteins  of  the  19S  particle  (Rpn2p  and 
Rpnl2p)  and  also  did  not  interact  with  the  Pre6p  protein  of  the 
20S  core  (Fig.  AA).  A  direct  Rpnlp-RpnlOp  interaction  also  was 
observed  in  a  reciprocal  GST-pulldown  assay,  in  which  the 
E.  co//-expressed  RpnlOp-FLAG  was  shown  to  be  retained  by  the 
prebound  GST-Rpnlp,  but  not  by  GST-Rpn2p  (Fig.  AB).  Rpnlp, 
Rpn2p,  RpnlOp,  and  the  six  Rpt  ATPases  Rptlp-Rpt6p  form  the 
base  of  the  19S  particle  that  is  proximal  to  the  20S  core  of  the 
26S  proteasome  (16,  31). 

Further  tests  indicated  that  overexpression  of  either  Rptlp  or 
Rpt6p  antagonized  the  ability  of  overexpressed  Rpnlp  to  sup¬ 
press  the  toxicity  of  overexpressed  N-end  rule  pathway  (Fig.  1 E 
and  F\  compare  with  Fig.  IS).  Taken  together,  these  results 
(Figs.  1, 3,  and  4)  strongly  suggested  that  Rpnlp,  which  does  not 
bind  to  Ubrlp,  suppresses  the  toxicity  of  co-overexpressed 
Ubrlp  and  Rad6p  at  least  in  part  through  the  demonstrated 
binding  of  Rpnlp  to  both  Rptlp  and  Rpt6p,  either  of  which  binds 
to  Ubrlp.  Schauber  et  al  (34)  have  shown  that  Rad23p,  which 
bears  an  N-terminal  Ub-like  moiety,  is  another  high-copy  sup¬ 
pressor  of  the  toxicity  of  overexpressed  N-end  rule  pathway. 
Rad23p  was  found  to  interact  with  Rptlp,  Rpt6p,  and  RpnlOp 
(35,  36).  In  the  present  work,  the  same  three  proteins  of  the  19S 
particle  were  shown  to  interact  with  Rpnlp,  another  protein  of 
this  particle,  suggesting  that  overexpression  of  Rpnlp  and 
Rad23p  suppresses  the  toxicity  of  overexpressed  N-end  rule 
pathway  through  similar  mechanisms. 

Ufd4p,  the  E3  of  the  Ub  Fusion  Degradation  (UFD)  Pathway,  Binds  to 
the  Proteasome.  The  substrates  of  another  Ub-dependent  proteo¬ 
lytic  system,  termed  the  UFD  pathway,  include  proteins  bearing  at 
their  N  termini  a  ‘"nonremovable”  Ub  moiety  (2^  37, 38).  A  partial 
or  complete  resistance  of  these  Ub-containing  proteins  to  deubiq- 
uitylating  enzyme-mediated  cleavage  stems  from  either  alterations 
of  the  last  residue  of  Ub  moiety  or  the  presence  of  proline  at  the 
C-terminal  side  of  the  Ub-protein  junction  (23).  Ufd4p,  a  member 
of  the  HECT  family  of  E3  proteins  (39),  is  the  E3  of  the  S.  cerevisiae 


UFD  pathway  (23).  Sequence  comparisons  did  not  detect  statisti¬ 
cally  significant  similarities  between  the  225K  Ubrlp  and  the  167K 
'  Ufd4p.  We  used  the  GST-pulldown  and  coimmunoprecipita- 
tion/immunoblotting  assays  to  determine  whether  Ufd4p  also 
could  bind  to  the  proteasome. 

Ufd4p  was  N-terminally  tagged  with  the  HA  epitope,  which 
did  not  impair  the  function  of  Ufd4p  (data  not  shown).  In  the 
GST-pulldown  assay,  extracts  from  S.  cerevisiae  overexpressing 
HA-Ufd4p  were  incubated  with  glutathione-agarose  beads  pre- 
loaded  with  GST-Rpnlp,  GST-Rpn2p,  GST-Rpn3p,  GST- 
RpnlOp,  GST-Rpnllp,  GST-Rpnl2p,  GST-Rptlp,  GST-Rpt2p, 
GST-Rpt6p,  or  GST  alone.  The  retained  proteins  were  fraction¬ 
ated  by  SDS /PAGE  and  immunoblotted  with  anti-H  A  antibody. 
As  shown  in  Fig.  5A,  HA-Ufd4p  was  found  to  be  specifically 
bound  to  GST-Rpt6p,  one  of  three  proteins  of  the  19S  particle 
that  interacted  with  Ubrlp,  the  E3  of  the  N-end  rule  pathway. 
None  of  the  other  tested  proteasomal  proteins  were  able  to 
interact  with  HA-Ufd4p  in  the  GST-pulldown  assay  (Fig.  5^1). 

We  then  examined  the  in  vivo  association  of  Ufd4p  with 
Rpt6p.  HA-Ufd4p  was  co-overexpressed  in  5.  cerevisiae  with 
Rpt6p-FLAG,  which  bore  the  C-terminal  FLAG  epitope.  The 
controls  included  congenic  cells  expressing  either  HA-Ufd4p 
alone  or  Rpt6p-FLAG  alone.  Proteins  were  immunoprecipitated 
from  cell  extracts  with  anti-HA  antibody,  followed  by 
SDS/PAGE  and  immunoblotting  with  either  anti-HA  antibody 
(Fig.  5B  Upper)  or  anti-FLAG  antibody  (Fig.  5B  Lower).  The 
results  indicated  that  Rpt6p-FLAG  was  specifically  coprecipi¬ 
tated  with  HA-Ufd4p  by  anti-HA  antibody. 

If  Ufd4p  interacts  with  the  mature  proteasome,  one  would 
expect  Rpnlp,  which  did  not  directly  interact  with  Ufd4p  (Fig. 
5A),  to  be  coimmunoprecipitated  with  HA-Ufd4p.  To  test  this 
conjecture,  HA-Ufd4p  was  co-overexpressed  in  S.  cerevisiae  with 
Rpnlp-FLAG,  which  bore  the  C-terminal  FLAG  epitope.  The 
controls  included  congenic  cells  expressing  either  HA-Ufd4p 
alone  or  Rpnlp-FLAG  alone.  Extracts  were  immunoprecipi¬ 
tated  with  anti-HA  antibody,  followed  by  SDS/PAGE  and 
immunoblotting  with  either  anti-HA  antibody  (Fig.  5C  Upper)  or 
anti-FLAG  antibody  (Fig.  5C  Lower).  Rpnlp-FLAG  was  indeed 
specifically  coprecipitated  with  HA-Ufd4p  by  anti-HA  antibody. 
Taken  together,  the  GST-pulldown  and  coimmuiioprecipitation 
results  (Fig.  5)  demonstrated  that  Ufd4p,  the  E3  of  the  UFD 
pathway,  was  physically  associated  with  the  proteasome. 

Discussion 

It  has  been  unclear  how  the  Ub  system  delivers  its  substrates  to 
the  26S  proteasome  for  processive  degradation.  Two  structural 
elements  of  a  targeted,  Ub-conjugated  substrate  could  serve  as 
ligands  for  the  substrate’s  docking  at  the  26S  proteasome:  a 
substrate-linked  multi-Ub  chain  and/ or  a  cognate  E3-E2  com¬ 
plex  reversibly  bound  to  the  substrate’s  degron.  Thus  far, 
RpnlOp  is  the  only  proteasomal  component  known  to  bind 
multi-Ub  chains;  it  also  binds  to  chain-bearing  model  substrates 
(refs.  10,  20,  and  21,  and  our  unpublished  data).  However, 
S.  cerevisiae  stains  lacking  RpnlOp  were  found  to  be  impaired  in 
the  degradation  of  only  some  of  the  model  substrates  and  did  not 
exhibit  aberrant  phenotypes  characteristic  of  proteolysis- 
impaired  proteasome  mutants  (10,  21). 

Our  discovery  that  Ubrlp  and  Ufd4p,  the  E3  components  of, 
respectively,  the  N-end  rule  pathway  and  the  UFD  pathway, 
directly  interact  with  specific  proteins  of  the  26S  proteasome 
suggests  a  general  mechanism  for  the  delivery  of  targeted 
substrates  to  the  proteasome.  In  this  model,  the  E3  component 
of  a  Ub-dependent  pathway  binds  (as  an  E3-E2  complex)  to  a 
cognate  degron  of  a  target  protein  and  initiates,  either  sequen¬ 
tially  or  concurrently,  two  sets  of  processes:  (f)  the  formation  of 
a  substrate-linked  multi-Ub  chain,  through  the  activity  of  sub¬ 
strate-bound  Ub  ligase  (E3-E2)  complex,  and  («)  the  delivery  of 
an  E3-bound  substrate  to  the  chaperone-like  proteins  of  the  26S 
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proteasome,  through  interactions  between  the  E3  and  specific 
proteins  of  the  19S  particle.  A  key  assumption  of  this  model  is 
that  the  demonstrated  physical  affinity  between  an  E3  and  the 
proteasome  is  essential  for  the  activity  (at  least  for  the  normal 
level  of  activity)  of  an  E3-mediated  proteolytic  pathway.  In  the 
case  of  the  N-end  rule  pathway,  this  prediction  can  be  tested,  for 
example,  through  the  mapping  of  a  site(s)  in  the  225K  Ubrlp  that 
mediates  its  binding  to  Rpn2p,  Rptlp,  and  Rpt6p  of  the  19S 
particle.  The  site(s)  thus  identified  then  could  be  mutated 
in  ways  that  leave  intact  the  other  functions  of  Ubrlp,  such  as  its 
binding  to  substrates,  to  the  Rad6p  E2  enzyme,  and  the  previ¬ 
ously  described,  RING  finger-dependent  activity  of  Ubrlp  in  the 
Rad6p/Ubrlp-mediated  formation  of  a  substrate-linked 
multi-Ub  chain  (27).  A  similar  approach  could  be  used  to  test  the 
model’s  assumption  in  regard  to  the  demonstrated  Ufd4p-Rpt6p 
interaction. 

Specific  functions  of  either  the  Ufd4p-Rpt6p  interaction  or  the 
interactions  of  Ubrlp  with  Rpn2p,  Rptlp,  and  Rpt6p  of  the  19S 
particle  are  unknown,  in  part  because  it  is  largely  unknown  how  the 
proteins  of  this  particle  contribute  to  the  proteasome-mediated 
proteolysis.  It  has  been  shown  that  Rptlp-Rpt6p,  the  six  ATPases 
of  the  19S  particle,  are  not  functionally  redundant,  and  that  they 
cooperate  in  preparing  individual  substrates  for  degradation  by  the 
20S  core  of  the  proteasome  (40).  It  is  likely  that  the  functional  role 
of  the  Ubrlp-Rpn2p  interaction  is  different  from  those  of  the 
l^rlp-Rptlp  and  Ubrlp-Rpt6p  interactions,  because  overexpres¬ 
sion  of  Rpn2p  was  found  to  have  virtually  no  effect  on  the  toxicity 
of  co-overexpressed  Ubrlp  and  Rad6p,  whereas  overexpression  of 
Rptlp  and  Rpt6p  increased  this  toxicity  (see  above).  One  possibility 
is  that  the  non-ATPase  Rpn2p  protein  of  the  19S  particle  functions 
as  the  main  Ubrlp  affinity  anchor,  whereas  the  Ubrlp-binding 
ATPases  Rptlp  and  Rpt6p  mediate  the  unfolding  of  a  Ubrlp- 
bound  substrate  before  its  translocation  into  the  20S  core.  If  so,  the 
observed  increase  in  the  toxicity  of  overexpressed  N-end  rule 
pathway  in  the  presence  of  free  (overproduced)  Rptlp  or  Rpt6p 
may  result  from  their  binding  to  the  Ubrlp  moiety  of  the  Ubrlp- 
Rad6p  complex  and  their  facilitation  of  the  unfolding  of  Ubrlp- 
bound  substrates  before  docking  at  the  proteasome.  Given  the 
differences  between  Ubrlp  and  Ufd4p  E3s  in  regard  to  their  ligands 
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in  the  19S  particle  (unlike  Ubrlp,  the  Ufd4p  E3  did  not  bind  to 
either  Rptlp  or  Rpn2p),  it  should  be  of  interest  to  determine 
whether  Rpt6p  is,  in  fact,  the  only  significant  ligand  of  Ufd4p  in  the 
proteasome. 

Several  aspects  of  the  proposed  delivery  mechanism  remain 
unconstrained  by  the  available  evidence.  For  example,  if  the 
demonstrated  Ubrlp-proteasome  and  Ufd4p-proteasome  interac¬ 
tions  (Figs.  3-5)  prove  relevant  to  the  functional  activity  of  the 
N-end  rule  and  UFD  pathways,  one  would  like  to  determine  how 
many  of  the  distinct  E3s  in  a  cell  interact  with  the  proteasome  and 
what  proteins  of  the  proteasome  these  E3s  bind  to.  Another 
important  question  is  whether  the  Ubrlp-dependent  formation  of 
a  substrate-linked  multi-Ub  chain  is  required,  in  vivo,  for  the 
docking  of  a  Ubrlp-bound  substrate  at  the  Ubr  1-binding  site  of  the 
proteasome.  An  alternative  model  is  that  these  two  Ubrlp- 
mediated,  substrate-centered  processes  (the  chain  formation  and 
the  docking  at  the  proteasome)  take  place  concurrently  and 
independently. 

The  previously  proposed  function  of  a  substrate-linked 
multi-Ub  chain  is  to  serve  as  a  dissociation-slowing  device  (9). 
Specifically,  if  the  rate-limiting  step  that  precedes  the  first 
proteolytic  cleavages  of  a  proteasome-bound  substrate  is  the 
substrate’s  unfolding  by  chaperones  of  the  19S  particle,  then  a 
decrease  in  the  rate  of  dissociation  of  the  proteasome-substrate 
complex,  brought  about  by  the  multi-Ub  chain,  should  facilitate 
substrate’s  degradation:  the  longer  the  allowed  '‘waiting”  time, 
the  greater  the  probability  of  a  required  unfolding  event  (13)! 
Similar  considerations  may  apply  to  the  function  of  E3-mediated 
binding  of  a  substrate  to  the  proteasome. 

The  main  result  of  this  work  is  that  the  E3  components  of  two 
distinct  Ub  ligases  are  physically  associated  with  specific  proteins 
of  the  26S  proteasome.  We  discussed  the  mechanistic  and 
functional  implications  of  this  advance. 
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Detecting  and  Measuring 
Cotranslational  Protein 
Degradation  in  Vivo 

Glenn  C.  Turner  and  Alexander  Varshavsky 

Nascent  polypeptides  emerging  from  the  ribosome  and  not  yet  folded  may  at 
least  transiently  present  degradation  signals  similar  to  those  recognized  by  the 
ubiquitin  system  in  misfolded  proteins.  The  ubiquitin  sandwich  technique  was 
used  to  detect  and  measure  cotranslational  protein  degradation  in  living  cells. 
More  than  50  percent  of  nascent  protein  molecules  bearing  an  amino-terminal 
degradation  signal  can  be  degraded  cotranslationally,  never  reaching  their 
mature  size  before  their  destruction  by  processive  proteolysis.  Thus,  the  folding 
of  nascent  proteins,  including  abnormal  ones,  may  be  in  kinetic  competition 
with  pathways  that  target  these  proteins  for  degradation  cotranslationally. 


Nascent  polypeptides  emerging  from  the  ri¬ 
bosome  may,  in  the  process  of  folding, 
present  hydrophobic  patches  and  other  struc¬ 
tural  features  that  serve  as  degradation  sig¬ 
nals  similar  to  those  recognized  by  the  ubiq¬ 
uitin  (Ub)  system  in  misfolded  or  otherwise 
damaged  proteins  (I).  Whether  a  substantial 
fraction  of  nascent  polypeptides  is  cotransla¬ 
tionally  degraded  is  a  long-standing  question. 

The  Ub  sandwich  technique  was  devel¬ 
oped  to  detect  cotranslational  protein  degra¬ 
dation  by  measuring  the  steady-state  ratio  of 
two  reporter  proteins  whose  relative  abun¬ 
dance  is  established  cotranslationally.  The 
technique  requires  that  the  polypeptide  to  be 
examined  for  cotranslational  degradation, 
termed  B,  be  sandwiched  between  two  stable 
reporter  domains,  A  and  C,  in  a  linear  fusion 
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protein  (Fig.  lA).  The  three  polypeptides  are 
connected  by  Ub  moieties,  creating  an  AUb- 
BUb-CUb  fusion  protein.  Ub-specific  pro¬ 
cessing  proteases  (UBPs)  cotranslationally 
cleave  such  linear  Ub  fusions  at  the  C-termi- 
nal  residue  of  Ub  (2-4),  generating  three 
independent  polypeptides,  AUb,  BUb,  and 
CUb  (5).  UBP-mediated  cleavage  establishes 
a  kinetic  competition  between  two  mutually 
exclusive  events  during  the  synthesis  of 
AUb-BUb-CUb:  cotranslational  UBP  cleav¬ 
age  at  the  BUb-CUb  junction  to  release  the 
long-lived  CUb  module  or,  alternatively,  co¬ 
translational  degradation  of  the  entire  BUb- 
CUb  nascent  chain  by  the  proteasome  (6) 
(Fig.  IB).  In  the  latter  case,  the  processivity 
of  proteasome-mediated  degradation  results 
in  the  destruction  of  the  Ub  moiety  between 
B  and  C  before  it  can  be  recognized  by  UBPs. 
The  resulting  drop  in  levels  of  the  CUb  mod¬ 
ule  relative  to  levels  of  AUb,  referred  to  as 
the  C/A  ratio,  reflects  the  cotranslational  deg¬ 
radation  of  domain  B  (Fig.  IB). 
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Fig.  3.  Nascent  poly-  A 
peptides  bearing  an  N- 
termlnal  degradation  g  =  r 
signal  can  be  degraded 
cotranslatlonally.  (A) 
Determination  of  C/A 
ratios  through  immuno-  ; 

precipitation  of  in  vivo- 
labeled  AUb-BUb-CUb  ^ 
fusion  proteins  (14). 

Two  variants  of  pgal 
were  used  as  domain  B,  one  carrying 
an  N-degron  (Re'^-Pgal;  ty^  ^  2 
min)  and  one  lacking  this  degrada¬ 
tion  signal  (Me'^-pgal;  ti/2  >  20 
hours).  Each  pair  of  lanes  corre¬ 
sponds  to  two  independent  experi¬ 
ments.  (B)  Immunoblot  analysis  of 
the  C/A  levels  for  different  B  do¬ 
mains  (described  in  Fig.  2A).  The 
identity  of  the  N-terminal  residue  of 
each  of  the  B  domains  is  indicated 
above  each  lane.  F  marks  the  lane  in 
which  domain  B  was  the  18-residue 
FLAC-contalning  moiety. 
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Fig.  1.  The  ubiquitin  sandwich  technique.  (A) 
Organization  of  a  Ub  sandwich  fusion.  The 
polypeptide  assayed  for  cotranslational  degra¬ 
dation,  B,  is  sandwiched  between  two  stable 
reporter  domains  A  and  C.  Red  arrows  indicate 
the  locations  of  UBP  cleavage  sites.  (B)  The 
principle  of  the  method.  The  reporter  module 
AUb  is  the  first  synthesized  and  is  cotransla- 
tionally  released  from  B,  thereby  providing  a 
measure  of  the  number  of  nascent  B  chains 
that  initially  emerge  from  the  ribosome.  If  deg¬ 
radation  of  the  emerging  B  domain,  indicated 
by  its  insertion  into  the  proteasome,  is  strictly 
posttranslational,  UBP-mediated  cleavage  at 
the  BUb-C  junction  releases  CUb  before  B  is 
degraded,  so  the  molar  yields  of  CUb  and  AUb 
are  identical.  However,  if  degradation  of  B  can 
be  cotranslational,  a  substantial  fraction  of 
BUb-CUb  may  be  degraded  as  a  unit.  This  will 
result  in  the  molar  yield  of  CUb  being  lower 
than  AUb,  the  difference  being  a  measure  of 
cotranslational  degradation. 

To  verify  that  UBP-mediated  cleavage  is 
cotranslational  (3),  we  carried  out  in  vivo 
radiolabeling  in  which  the  labeling  pulse  was 
substantially  shorter  than  the  time  required 
for  the  complete  synthesis  of  AUb-BUb- 
CUb.  Saccharomyces  cerevisiae  cells  ex¬ 
pressing  the  fusion  protein  {DHFRhaUb}  - 
{Me^pgalUb}  -  {Me^DHFRhaUb}  (Fig. 
2A),  predicted  to  require  —350  s  for  complete 
synthesis,  were  radiolabeled  for  45  s  (7). 
Labeling  was  terminated  by  addition  of  cy- 
cloheximide,  and  UBPs  were  simultaneously 
inactivated  with  A-ethylmaleimide  (NEM) 
(3).  Under  these  conditions,  nascent  chains 
that  are  just  starting  to  be  synthesized  when 
the  pulse  begins  will  incorporate  label  into 
the  N-terminal  A  domain  but  do  not  elon¬ 
gate  to  full-length  chains  (Fig.  2B).  Thus, 


detection  of  free,  labeled  AUb  and  BUb  by 
immunoprecipitation  indicated  that  UBP- 
mediated  cleavage  at  the  AUb-BUb  junc¬ 
tion  was  cotranslational  (Fig.  2C).  No  full- 
length  AUb-BUb-CUb  fusion  was  detected 
(Fig.  2C),  indicating  that  cotranslational 


cleavage  by  UBPs  was  highly  efficient. 

Previous  experiments  bearing  on  cotrans¬ 
lational  degradation  used  inhibitors  or  cell- 
free  systems  {8-10).  Nascent  polypeptide 
chains  might  be  protected  from  degradation 
in  vivo,  either  because  they  are  sterically 
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Fig.  4.  The  extent  of  cotrans- 
lational  protein  degradation 
depends  on  the  presence  of  a 
degron,  the  activity  of  a  de- 
gron-specific  proteolytic 
pathway,  and  the  nature  and 
size  of  the  protein.  (A)  C/A 
ratios  obtained  for  the  dif¬ 
ferent  B  domains.  (B)  Ubrlp- 
dependent  cotranslational 
degradation  (75).  Each  bar 
represents  a  mean  value  de¬ 
rived  from  at  least  four  inde¬ 
pendent  experiments;  stan¬ 
dard  errors  are  indicated. 

Differences  between  means 
were  significant  to  at  least 

P  <  0.05  by  the  Mann-Whitney  test.  Bars  are  as  in  (A). 
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shielded  by  chaperones  or  because  their 
translation  time  is  short  compared  with  the 
time  required  for  targeting  by  the  degradation 
machinery.  The  Ub  sandwich  technique  was 
used  to  detect  in  vivo  cotranslational  degra¬ 
dation  of  a  118-kD,  p-galactosidase  (Pgal)- 
derived  polypeptide  carrying  a  strong  N-ter- 
minal  degradation  signal,  specifically  an  N- 
degron  (Fig.  2A).  The  pgal-linked  N-degron 
comprises  the  destabilizing  N-terminal  resi¬ 
due  arginine  (R)  and  a  short,  lysine  (K)- 
bearing  extension,  e^  (Fig.  2A),  that  is  the 
site  of  multi-Ub  chain  attachment  (//). 
Ubrlp,  the  E3  component  of  the  N-end  rule 
pathway,  targets  Re^-pgal  for  rapid  degrada¬ 
tion  in  vivo  [half-life  (7j^2)  ^  ^  Tnin]  (2,  12). 
Changing  the  N-terminal  residue  of  the  pro¬ 
tein  to  methionine  (M)  inactivates  the  degra¬ 
dation  signal  by  precluding  recognition  by 
Ubrlp.  The  resulting  Me^-pgal  is  posttrans- 
lationally  stable  (/^^^  ^  20  hours)  (2).  AUb- 
BUb-CUb  fusion  proteins  in  which  domain  B 
was  either  Re^-Pgal  or  its  N-  degron-lacking 
counterpart  Me*^-pgal  (Fig.  2A)  were  ex¬ 
pressed  in  S.  cerevisiae  strains  containing 
different  levels  of  Ubrlp  (75).  The  extent  of 
cotranslational  degradation  was  assessed  by 
radiolabeling  for  30  min  and  immunoprecipi- 
tation  {14)  to  determine  the  levels  of  CUb 
relative  to  AUb  (the  C/A  ratio). 

The  C/A  ratio  was  lower  in  cells  express¬ 
ing  the  N-degron- bearing  domain  B,  but 
only  in  those  strains  that  also  expressed 
Ubrlp  (Fig.  3 A).  To  determine  the  percent¬ 
age  of  nascent  chains  cotranslationally  de¬ 
graded  by  the  N-end  rule  pathway,  we  com¬ 
pared  the  C/A  ratios  for  Re*^-pgal  in  wild- 
type  (0.52)  and  Ubrlp-overexpressing  strains 
(0.39)  with  the  ratio  found  with  the  ubrlbk. 
strain  (0.86)  (Fig.  4A).  This  comparison  in¬ 
dicated  that  ~40%  of  the  nascent  Re*^-pgal 
chains  were  cotranslationally  degraded  in  the 
wild-type  strain  (Fig.  4B)  (75).  This  fraction 
increased  to  —55%  when  Ubrlp  was  overex¬ 
pressed  from  the  promoter. 

The  extent  of  cotranslational  degradation  of 
two  other  B  domains  of  different  sizes  was  also 
determined.  The  mammalian  Sindbis  virus 


RNA  polymerase,  termed  nsP4,  is  a  69-kD 
protein  that  naturally  bears  an  N-degron  (7d), 
and  Xe’^-Ura3p  (X  =  M  or  R)  is  a  34-kD 
enzyme  of  the  S.  cerevisiae  uracil  biosynthetic 
pathway  that  either  carries  (Re^-Ura3p)  or 
lacks  (Me’^-Ura3p)  an  N-degron  (7  7)  (Fig.  2A). 
Although  the  short-lived  R-nsP4  (69  kD)  and 
Re^-pgal  (118  kD)  were  cotranslationally  de¬ 
graded  to  similar  extents,  Re^-Ura3p  (34  kD), 
which  was  also  short-lived  posttranslationally 
(77),  exhibited  much  less  cotranslational  deg¬ 
radation  than  the  other  two  proteins  (Fig.  3B). 
Radiolabeling  and  immunoprecipitation  {14) 
showed  that  —50%  of  the  nascent  chains  of 
R-nsP4  were  cotranslationally  degraded  by  the 
N-end  rule  pathway  in  wild-type  cells  and 
—55%  in  cells  overexpressing  Ubrlp  (Fig.  4B). 
These  values  were  similar  to  —40%  and  —55% 
cotranslational  degradation  of  Re^-(3gal  in 
these  strains,  respectively  (Fig.  4B).  In  contrast, 
only  —20%  cotranslational  degradation  was  ob¬ 
served  with  the  34-kD  Re^-Ura3p  in  either 
wild-type  or  Ubrlp-overexpressing  strains  (Fig, 
4B).  These  results  suggest  that  smaller  proteins 
are  less  susceptible  to  cotranslational  degrada¬ 
tion;  however,  factors  other  than  translation 
time  likely  influence  the  presentation  or  acces¬ 
sibility  of  the  degradation  signal  by  the  nascent 
chain,  because  protein  size  was  not  directly 
proportional  to  the  extent  of  cotranslational 
degradation. 

These  comparisons  established  the  amounts 
of  cotranslational  degradation  by  the  N-end  rule 
pathway.  The  C/A  ratios  for  two  of  the  N- 
degron-lacking  B  domains,  Me^-pgal  (0.84) 
and  M-nsP4  (0.72),  were  less  than  the  C/A  ratio 
obtained  with  a  very  short  FLAG  epitope-con¬ 
taining  sequence  that  lacks  a  degradation  signal 
and  exhibits  a  C/A  ratio  of  0.97,  indistinguish¬ 
able  from  the  theoretical  value  of  1 .0  (Fig.  4A). 
This  suggests  that  —15%  of  Me^-pgal  and 
—25%  of  M-nsP4  nascent  chains  might  be  co¬ 
translationally  degraded  by  a  Ubrlp-indepen- 
dent  pathway.  Premature  termination  of  trans¬ 
lation  and/or  transcription  may  also  contribute 
to  the  drop  in  the  C/A  ratios  for  these  B  do¬ 
mains.  However,  it  is  unlikely  that  premature 
termination  of  translation  accounts  entirely  for 


the  difference  between  the  C/A  ratios  for  FLAG 
and  M-nsP4,  because  the  codon  adaptation  in¬ 
dex  of  the  M-nsP4  open  reading  frame  (0.1)  is 
higher  than  the  one  for  Me^-pgal  (0.07),  but 
the  drop  in  C/A  ratio  is  greater  with  M-nsP4 
than  with  Me^-pgal.  The  degradation  of  newly 
synthesized  polypeptides  is  a  major  source  of 
peptides  presented  to  the  immune  system  {18, 
19).  Our  observations  with  the  above  two  B 
domains  suggest  that  the  source  of  these  pep¬ 
tides  is  cotranslational  protein  degradation.  In 
this  regard,  it  is  interesting  that  nsP4  is  a  viral 
protein  that  is  presented  efficiently  to  the  im¬ 
mune  system  by  infected  cells. 

The  extent  of  cotranslational  degradation 
can  be  strikingly  high:  In  the  case  of  Re^- 
pgal  in  Ubrlp-overexpressing  cells,  over 
50%  of  nascent  polypeptide  chains  never 
reach  their  full  size  before  their  destruction 
by  processive  proteolysis.  Thus,  if  a  nascent 
chain  displays  a  degron  of  the  Ub  system, 
such  a  protein  becomes  a  target  of  kinetic 
competition  between  cotranslational  biogen¬ 
esis  and  cotranslational  degradation.  Because 
the  folding  of  a  protein  molecule  begins  dur¬ 
ing  its  synthesis  on  the  ribosome,  a  nascent 
polypeptide  may  cotranslationally  expose 
degradation  signals  that  become  shielded 
through  the  folding  of  the  newly  formed  pro¬ 
tein  {20,  21).  Thus,  cotranslational  protein 
degradation  may  represent  a  form  of  protein 
quality  control  that  destroys  nascent  chains 
that  fail  to  fold  correctly  rapidly  enough. 
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The  N-end  rule  relates  the  in  vivo  half-life  of  a  protein 
to  the  identity  of  its  N-terminal  residue.  We  used  an 
expression-cloning  screen  to  search  for  mouse  proteins 
that  are  degraded  by  the  ubiquitin/proteasome-depend- 
ent  N-end  rule  pathway  in  a  reticulocyte  lysate.  One 
substrate  thus  identified  was  RGS4,  a  member  of  the 
RGS  family  of  GTPase-activating  proteins  that  down- 
regulate  specific  G  proteins.  A  determinant  of  the  RGS4 
degradation  signal  (degron)  was  located  at  the  N  termi¬ 
nus  of  RGS4,  because  converting  cysteine  2  to  either 
glycine,  alanine,  or  valine  completely  stabilized  RGS4. 
Radiochemical  sequencing  indicated  that  the  N-termi- 
nal  methionine  of  the  lysate-produced  RGS4  was  re¬ 
placed  with  arginine.  Since  N-terminal  arginine  is  a  de¬ 
stabilizing  residue  not  encoded  by  RGS4  mRNA,  we 
conclude  that  the  degron  of  RGS4  is  generated  through 
the  removal  of  N-terminal  methionine  and  enzymatic 
arginylation  of  the  resulting  N-terminal  cysteine. 
RGS16,  another  member  of  the  RGS  family,  was  also 
found  to  be  an  N-end  rule  substrate.  RGS4  that  was 
transiently  expressed  in  mouse  L  cells  was  short-lived  in 
these  cells.  However,  the  targeting  of  RGS4  for  degrada¬ 
tion  in  this  in  vivo  setting  involved  primarily  another 
degron,  because  N-terminal  variants  of  RGS4  that  were 
stable  in  reticulocyte  lysate  remained  unstable  in  L 
cells. 


A  multitude  of  regulatory  circuits,  including  those  that  con¬ 
trol  the  cell  cycle,  cell  differentiation,  and  responses  to  stress, 
involve  metabolically  unstable  proteins  (1-5).  A  short  in  vivo 
half-life  of  a  regulator  provides  a  way  to  generate  its  spatial 
gradients  and  allows  for  rapid  adjustments  of  its  concentration, 
or  submit  composition,  through  changes  in  the  rate  of  its 
3501  thesis  or  degradation.  Damaged  or  otherwise  abnormal  pro¬ 
teins  tend  to  be  short-lived  as  well  (6).  Features  of  proteins  that 
confer  metabolic  instability  are  called  degradation  signals  or 
degrons  (7,  8),  The  essential  component  of  one  degradation 
signal,  called  the  N-degron,  is  a  destabilizing  N-terminal  resi¬ 
due  of  a  protein  (9).  A  set  of  amino  acid  residues  that  are 
destabilizing  in  a  given  cell  yields  a  rule,  called  the  N-end  rule, 
which  relates  the  in  vivo  half-life  of  a  protein  to  the  identity  of 
its  N-terminal  residue.  The  N-end  rule  pathway  is  present  in 
all  organisms  examined,  from  mammals  and  plants  to  fungi 
and  prokaryotes  (10,  11). 
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In  eukaryotes,  an  N-degron  consists  of  two  determinants,  a 
destabilizing  N-terminal  residue  and  an  internal  lysine  or  ly¬ 
sines  (12-15).  The  Lys  residue  is  the  site  of  formation  of  a 
multiubiquitin  chain  (16).  The  N-end  rule  pathway  is  thus  one 
of  the  pathways  of  the  ubiquitin  (Ub)^  system.  Ub  is  a  76- 
residue  protein  whose  covalent  conjugation  to  other  proteins 
plays  a  role  in  a  vast  range  of  biological  processes  (4,  5,  17).  In 
most  of  them,  Ub  acts  through  routes  that  involve  the  degra¬ 
dation  of  ubiquitylated^  proteins  by  the  26  S  proteasome,  an 
ATP-dependent  multisubunit  protease  (18,  19). 

The  N-end  rule  has  a  hierarchic  structure.  In  the  yeast 
Saccharomyces  cerevisiae,  Asn  and  Gin  are  tertiary  destabiliz¬ 
ing  N-terminal  residues  in  that  they  function  through  their 
deamidation,  by  the  NTAl-encoded^  N-terminal  amidase  (Nt- 
amidase),  to  yield  the  secondary  destabilizing  residues  Asp  and 
Glu  (20).  The  destabilizing  activity  of  N-terminal  Asp  and  Glu 
requires  their  conjugation,  by  the  ATEl-encoded  Arg-tRNA- 
protein  transferase  (R- transferase),  to  Arg,  one  of  the  primary 
destabilizing  residues  (21,  22).  The  primary  destabilizing  N- 
terminal  residues  are  bound  directly  by  UBRl,  also  called 
N-recognin,  the  E3  (recognition)  component  of  the  N-end  rule 
pathway  (10,  11), 

In  mammals,  the  deamidation  step  is  mediated  by  two  Nt- 
amidases,  Nt^-amidase  and  Nt^-amidase,  which  are  specific, 
respectively,  for  N-terminal  Asn  and  Gin  (23,  24).  In  verte¬ 
brates,  the  set  of  secondary  destabilizing  residues  contains  not 
only  Asp  and  Glu  but  also  Cys,  which  is  a  stabilizing  residue  in 
yeast  (25,  26).  The  mammalian  counterpart  of  the  yeast  R- 
transferase  Atelp  exists  as  two  distinct  species,  ATE  1-1  and 
ATE  1-2,  that  are  produced  through  alternative  splicing  of  Ate  1 
pre-mRNA  (27).  Both  ATE  1-1  and  ATE  1-2  are  similar  in  spec¬ 
ificity  to  the  ATEi -encoded  yeast  R- transferase,  in  that  these 
R-transferases  can  arginylate  N-terminal  Asp  and  Glu,  but 
cannot  arginylate  N-terminal  Cys  (27),  suggesting  the  exist¬ 
ence  of  a  distinct  R-transferase  specific  for  N-terminal  Cys, 


^  The  abbreviations  used  are:  Ub,  ubiquitin;  Nt-araidase,  N-terminal 
amidase;  R-transferase,  Arg-tRNA-protein  transferase;  Ub-X-nsP4*, 
Ub-X-nsP4i_254;  GST,  glutathione  transferase;  RGS,  regulator  of  G- 
protein  signaling;  MetAP,  methionine  aminopeptidase;  GAP,  GTPase- 
activating  protein;  DUB,  deubiquitylating  enzyme;  AMP-PNP,  5'-ad- 
enylylimidodiphosphate;  CAPS,  3-(  cyclohexyl  amino)propanesulfonic 
acid;  PAGE,  polyacrylamide  gel  electrophoresis;  PCR,  polymerase  chain 
reaction;  ORF,  open  reading  frame. 

^  Ubiquitin  whose  C-terminal  (Gly-76)  carboxyl  group  is  covalently 
linked  to  another  compound  is  called  the  ubiquityl  moiety,  the  deriva¬ 
tive  terms  being  ubiquitylation  and  ubiquity lated.  The  term  Ub  refers 
to  both  free  ubiquitin  and  the  ubiquityl  moiety.  This  nomenclature  (5), 
which  is  also  recommended  by  the  Nomenclature  Committee  of  the 
International  Union  of  Biochemistry  and  Molecular  Biology  (74),  brings 
ubiquitin-related  terms  in  line  with  the  standard  chemical  terminology. 

^Throughout  the  text,  the  names  of  genes  are  italicized  and  all 
uppercase.  The  names  of  proteins  are  roman  and  all  uppercase.  This 
usage,  a  modification  of  the  existing  conventions  (73),  provides  a  uni¬ 
form  nomenclature  in  a  text  that  refers  to  both  fungal  and  metazoan 
genes  and  proteins. 
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UBRl  (N-recognin)  of  yeast  and  mammals  has  two  binding 
sites  for  the  primary  destabilizing  N-terminal  residues  of  ei¬ 
ther  proteins  or  short  peptides.  The  type  1  site  is  specific  for  the 
basic  N-terminal  residues  Arg,  Lys,  and  His.  The  type  2  site  is 
specific  for  the  bulky  hydrophobic  N-terminal  residues  Phe, 
Leu,  Trp,  Tyr,  and  He  (25,  28,  29).  UBRl  contains  yet  another 
substrate-binding  site,  which  targets  proteins  bearing  internal 
(non-N-terminal)  degrons.  These  proteins  include  CUP9  and 
GPAl  in  yeast  (30-32)  and  the  encephalomyocarditis  virus  3C 
protease  in  metazoans  (33). 

The  known  functions  of  the  N-end  rtile  pathway  include  the 
control  of  peptide  import  in  S.  cerevisiae^  through  the  degrada¬ 
tion  of  CUP9,  a  transcriptional  repressor  of  the  peptide  trans¬ 
porter  PTR2  (30)  (this  control  includes  a  positive  feedback 
mediated  by  the  type  1  and  type  2  sites  of  UBRl  (31));  the 
degradation  of  GPAl,  one  of  two  Ga  proteins  in  5.  cerevisiae 
(32);  and  the  degradation  of  alphaviral  RNA  pol5anerases  and 
other  viral  proteins  in  infected  metazoan  cells  (33,  34).  Physi¬ 
ological  N-end  rule  substrates  were  also  identified  among  the 
proteins  secreted  into  the  cytosol  of  the  mammalian  cell  by 
intracellular  parasites  such  as  the  bacterium  Listeria  monocy¬ 
togenes  (35).  Selective  perturbation  of  the  N-end  rule  pathway 
was  reported  to  interfere  with  mammalian  cell  differentiation 
(36, 37)  and  with  limb  regeneration  in  amphibians  (38).  Studies 
of  the  Ub-dependent  proteolysis  of  endogenous  proteins  in  mus¬ 
cle  extracts  suggested  that  the  N-end  rule  pathway  plays  a  role 
in  catabolic  states  that  result  in  muscle  atrophy  (39). 

Until  the  present  work,  physiological  substrates  of  Nt-ami- 
dases  and  R-transferases  were  unknown  in  either  yeast  or 
larger  eukaryotes.  Engineered  N-end  rule  substrates,  including 
the  substrates  of  Nt-amidases  and  R-transferases,  can  be  pro¬ 
duced  in  vivo  through  the  Ub  fusion  technique,  in  which  a 
Ub-X-reporter  fusion  is  cleaved,  cotranslationally,  after  the  last 
residue  of  Ub  by  deubiquitylating  enzymes  (DUBs)  (40),  yield¬ 
ing  a  reporter  protein  bearing  the  predetermined  N-terminal 
residue  X  (9,  10,  41). 

In  the  present  work,  we  employed  a  modification  of  the 
cDNA-based  sib-selection  strategy  in  a  transcription-transla¬ 
tion  lysate  from  rabbit  reticuloc3d:es  (42, 43)  to  identify  putative 
physiological  substrates  of  the  N-end  rule  pathway.  Specifi¬ 
cally,  we  used  dipeptides  bearing  destabilizing  N-terminal  res¬ 
idues  as  selective  inhibitors  of  the  N-end  rule  pathway,  and  we 
screened  for  mouse  cDNAs  that  expressed  proteins  whose  rel¬ 
ative  abundance  in  the  lysate  was  altered  in  the  presence  of 
relevant  dipeptides. 

Among  the  putative  N-end  rule  substrates  identified  through 
the  use  of  this  approach  was  mouse  RGS4,  a  GTPase-activating 
protein  (GAP)  for  specific  Ga  subunits  of  heterotrimeric  G 
proteins,  and  a  member  of  the  family  of  RGS  (regulator  of  G 
protein  signaling)  proteins  (44-50).  We  discovered  that  in  ad¬ 
dition  to  the  expected  removal  of  N-terminal  Met  from  the 
newly  formed  RGS4,  the  resulting  N-terminal  Cys  of  RGS4  was 
arginylated,  presumably  by  a  distinct  R-transferase  whose  Cys 
specificity  is  different  from  that  of  the  known  R-transferases. 
Thus  modified  RGS4  bore  N-terminal  Arg,  a  primary  destabi- 
hzing  residue,  and  was  degraded  by  the  N-end  rule  pathway  in 
reticulocyte  lysate. 

EXPERIMENTAL  PROCEDURES 

Plasmids — The  plasmids  pcDNA3-Ub-X-nsP4 1,254  (where  X  is  Met, 
Arg,  or  Tyr)  were  used  to  express  Ub-X-nsP4 1,254  fusions,  denoted  below 
as  Ub-X-nsP4*,  from  the  phage  T7  promoter  in  the  transcription-trans¬ 
lation  reticuloc3rte  lysate  system  by  Promega  (Madison,  WI).  The  nota¬ 
tion  nsP4i,254  refers  to  the  254-residue  N-terminal  fragment  of  the 
69-kDa  nsP4,  the  Sindbis  virus  RNA  polymerase  (34).  The 
pcDNA3-Ub-X-nsP4i_254  plasmids  were  constructed  using  a  set  of  open 
reading  frames  (ORFs)  encoding  fusions  between  Ub  and  the  full-length 
nsP4  (51).  The  Ub-X-nsP4  ORFs  in  pJCEXl  (a  gift  from  Dr.  T.  Ru- 


menapf,  Federal  Research  Center  for  Virus  Diseases  of  Animals,  Tub¬ 
ingen,  Germany)  were  used  as  templates  for  PGR  with  two  primers 
5'-TTCGGATCCGCCACCATGCAGATCTTCGTGAAGACCCTG-3'  and 
5'-CCTTCTAGACTATGCGGTGACAAACTCAGTGGTAAT-3',  to  pro¬ 
duce  DNA  fragments  encoding  Ub-X-nsP4 1,354  fusions  (the  underlined 
sequences  corresponded  to  the  start  and  stop  codons,  respectively). 
These  fragments  were  digested  with  BamUl  and  Xbal  and  cloned  into 
the  RamHI/X6aI-cut  pcDNAS  vector  (Invitrogen,  Carlsbad,  CA). 

The  plasmid  pcDNA3-Ub-Lys-mCL  expressed  Ub-Lys-mCL,  where 
Lys-mCL  was  an  N-terminally  truncated  large  subunit  of  m-calpain 
starting  with  Lys- 10.  The  primers  5'-ATTCCGCGGTGGCAAAGACC- 
GCGAGGCGGCCGAGGGGCTG-3'  and  5'-CCTACTAGTCTATCATAG- 
GACTGAAAAACTCACX)CACGAGAT-3'  were  used  to  amplify  a  DNA 
fragment  from  pET24-80k,  which  contained  cDNA  encoding  the  large 
subunit  of  rat  ra-calpain  (52)  (a  gift  from  Dr.  J.  S.  Elce,  Queen’s  Uni¬ 
versity,  Kingston,  Ontario,  Canada).  The  resulting  fragment  was  di¬ 
gested  with  SacII  and  Spel  and  cloned  into  5acII/SpcI-cut 
pcDNA3-Ub-Met-nsP4i,254.  The  plasmids  pcDNA3-p94  and 
pcDNA3-p94ci29A  were  used  to  express  the  mouse-specific  calpain  p94 
and  its  C129A  mutant.  To  produce  these  constructs,  plasmids  contain¬ 
ing  the  rat  p94  ORF  and  the  p94ci29A  ORF  (53)  (gifts  from  Dr.  H. 
Sorimachi  and  Dr.  K.  Suzuki,  University  of  Tokyo,  Japan)  were  PCR- 
amplified  and  cloned  into  the  pcDNA3  vector. 

The  plasmid  pcDNA-RGS4  was  used  for  expression  of  the  mouse 
RGS4  in  reticulocyte  lysate  and  for  transfection-mediated  expression  of 
RGS4  in  mouse  L  cells.  This  plasmid  contained  the  T7  and  CMV 
promoters,  and  the  mouse  RGS4  ORF  followed  by  two  stop  codons.  The 
RGS4  ORF  was  PCR-amplified  from  the  plasmid  pcDNA3-26-16-ll  (see 
below),  using  the  primers  5'-TTCGGATCCGCCACCATGTGCAAAGG- 
ACTTGCAGGTCTG-3'  and  5'-CCTTCTAGATCATTAGGCACACTGG- 
GAGACCAGGGA-3',  followed  by  digestion  with  BamKl  andX6aI  and 
cloning  into  RamHI/Xbal-cut  pcDNA3  (the  underlined  sequences  cor¬ 
responded  to  the  start  codon  and  two  stop  codons,  respectively).  The 
plasmids  pcDNAS-RGSlj^jg^,  pcDNA3-RGS4c2G»  pcDNA3-RGS4c2v? 
pcDNA3c2A»  PcDNA3-RGS4k3r,  pcDNA3-RGS4k3s,  and  pcDNA3- 
RGS4i9,205,  which  expressed  specific  RGS4  mutants,  were  constructed 
from  pcDNA3-RGS4  using  PCR-based  site-directed  mutagenesis  (54). 

The  plasmid  pcDNA3-RGS4j,ii9A-GST  expressed  RGS4j^i9^-GST,  a 
fusion  of  RGS4mi9a  and  glutathione  transferase  (GST).  It  was  con¬ 
structed  through  a  PCR-mediated  fusion  of  DNA  fragments  encoding 
RGS4  and  GST,  followed  by  cloning  into  RamHI/Xbal-cut  pcDNAS. 
PCR-mediated  site-directed  mutagenesis  was  then  used  to  introduce 
the  M19A  mutation.  The  plasmid  pcDNA3-RGS16,  expressing  mouse 
RGS16  from  the  T7  promoter  (55),  and  a  PCR-produced  fragment  en¬ 
coding  mouse  GPgL  were  gifts  from  Dr.  C.  K.  Chen  and  Dr.  M.  I.  Simon 
(California  Institute  of  Technology,  Pasadena,  CA).  The  entire  coding 
regions  of  the  final  plasmid  constructs  were  verified  by  DNA 
sequencing. 

In  Vitro  Transcription-Translation-Degradation  System — The  TNT 
Quick-coupled  Transcription-Translation  System  (Promega)  contained 
a  rabbit  reticulocyte  lysate  pre-mixed  with  most  of  the  reaction  compo¬ 
nents  necessary  to  carry  out  transcription/translation  in  the  lysate, 
including  all  of  the  amino  acids  except  methionine.  [^®S] Methionine 
(>1,000  Ci/mmol,  Amersham  Pharmacia  Biotech)  was  used  to  label 
newly  formed  proteins  in  the  lysate.  Proteins  labeled  with  ^H-amino- 
acids  were  produced  in  the  TNT  T7-coupled  Reticulocyte  Lysate  (Pro¬ 
mega),  a  version  of  the  system  where  the  main  components  of  the 
reaction  were  supplied  separately.  The  reactions  were  set  up  according 
to  the  manufacturer’s  instructions.  The  reaction  mixtures  containing 
dipeptides  also  contained  0.15  mM  bestatin  (Sigma),  an  inhibitor  of 
some  aminopeptidases,  to  reduce  degradation  of  the  added  dipeptides 
(28).  Stock  samples  of  dipeptides  were  0,5  M  solutions  in  10  mM  K- 
HEPES,  pH  7.5.  The  reactions  were  incubated  at  30  ®C,  unless  stated 
otherwise;  they  were  terminated  by  the  addition  of  SDS-saraple  buffer, 
heated  at  95  ®C  for  5  min,  and  fractionated  by  SDS-PAGE,  followed  by 
autoradiography  or  fluorography.  ^®S  in  protein  bands  was  determined 
using  Phosphorlmager  (Molecular  Dynamics,  Sunn5rvale,  CA). 

Screening  of  Mouse  cDNA  Library  of  Small  Pools  in  Rabbit  Reticu¬ 
locyte  Lysate — Total  RNA  was  isolated  from  the  brains  of  female  mice 
with  the  guanidine  thiocyanate  method  (54).  Poly(A)'’’  RNA  was  puri¬ 
fied  using  Oligotex  mRNA  kit  (Qiagen,  Valencia,  CA)  and  was  used  to 
synthesize  oligo(dT)-primed  double-stranded  cDNA  with  the  Super- 
Script  cDNA  synthesis  kit  (Life  Technologies,  Inc.).  After  ligating  the 
DNA  fragments  to  a  BstXL  adapter  (Invitrogen,  San  Diego)  and  digest¬ 
ing  with  No/I,  the  cDNAs  were  cloned  into  Bs/XI-No/I-cut  pcDNA3.  The 
resulting  cDNA  library  was  introduced,  by  electroporation,  into  Esche¬ 
richia  coli  SURE  cells  (Stratagene,  La  Jolla,  CA),  and  the  transfor¬ 
mants  were  frozen  in  a  number  of  samples.  Titration  of  the  library 
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showed  that  it  contained  3-4  X  10®  independent  clones.  Pools  of  the 
library  cDNAs  containing  each  about  50  clones  were  prepared  as  de¬ 
scribed  (42).  Individual  pools  were  added  to  the  T7  TNT  Quick-coupled 
Transcription-Translation  System  (Promega)  together  with  [®®S]  methi¬ 
onine,  in  either  the  presence  or  the  absence  of  the  mixture  of  5  raM 
Arg-/3-Ala,  5  Trp-Ala  and  0.15  mM  bestatin.  All  components  of  a 
reaction  except  the  lysate  were  gently  mixed  together,  followed  by  the 
addition  of  lysate.  The  reactions  were  performed  in  the  total  volume  of 
6.25  /il  in  a  96-well  microtiter  plate  at  30  ®C  for  3.5  h.  These  conditions 
were  chosen  after  preliminary  optimization,  using  iyr-nsP4*  as  a  test 
protein.  Once  a  cDNA  pool  was  found  to  express  a  protein  whose 
relative  abundance  was  increased  in  the  presence  of  the  dipeptides,  the 
pool  was  progressively  subdivided  and  retested,  until  the  isolation  of  a 
single  positive  cDNA  clone.  One  of  the  clones  thus  isolated,  termed 
pcDNA3-26-16-ll,  contained  as  ~3-kilobase  pair  mouse  RGS4  cDNA 
(see  “Results”). 

N-terminal  Radiochemical  Sequencing — RGS4j^i9A"^ST  was  ex¬ 
pressed  in  the  TNT  T7-coupled  Reticulocyte  Lysate  System  (Promega) 
in  the  presence  of  either  [^H] arginine  (49  Ci/mmol),  PH] leucine  (155 
Ci/mmol),  or  pH]lysine  (87  Ci/mmol)  (Amersham  Pharmacia  Biotech),  1 
mM  Arg-^-Ala,  and  0.15  mM  bestatin  for  30  min  at  30  ®C,  in  the  total 
volume  of  0.4  ml.  The  reaction  was  stopped  by  the  addition  of  20  mM 
AMP-PNP  (Sigma),  a  non-hydrolyzable  ATP  analog,  and  the  labeled 
RGS4mi9a’GST  was  partially  purified  by  affinity  chromatography  on 
glutathione-Sepharose  4B  (Amersham  Pharmacia  Biotech),  followed  by 
SDS-10%  PAGE  and  the  electrophoretic  transfer  of  separated  proteins 
in  10%  methanol,  10  mM  CAPS,  pH  11,  to  Immobiion-P  membrane 
(Millipore,  Burlington,  MA).  The  transferred  band  of  RGS4mi9a’GST 
was  cut  out  (its  position  was  determined  with  respect  to  stained  protein 
markers)  and  subjected  to  multiple  cycles  of  Edman  degradation,  using 
the  47 6A  Applied  Biosystems  sequencer  (Perkin-Elmer).  The  fractions 
from  each  cycle  were  collected,  and  in  the  fractions  was  determined 
using  a  scintillation  counter. 

Transfections  of  Mouse  L  Cells  and  Pulse-Chase  Assay — Transient 
transfections  of  mouse  L  cells  and  pulse-chase  analysis  were  performed 
as  described  previously  (56).  The  affinity-purified  polyclonal  antibody 
raised  against  a  peptide  near  the  C  terminus  of  RGS4  was  purchased 
from  Santa  Cruz  Biotechnology  (Santa  Cruz,  CA).  In  the  experiments 
with  proteasome  inhibitor  MG132  (Calbiochem),  it  was  added  to  cells, 
at  the  final  concentration  of  50  /xM,  45  min  before  the  addition  of 
[^®S]  methionine  and  was  present  throughout  the  pulse-chase  assay. 

RESULTS 

Degradation  of  N -end  Rule  Substrates  in  a  Transcription- 
Translation  System  Derived  from  Reticulocytes — The  N-end 
rule  pathway  can  be  selectively  inhibited  in  reticulocyte  lysates 
through  the  addition  of  dipeptides  bearing  either  type  1  or  type 
2  destabilizing  N-terminal  residues  (25,  28).  A  commercially 
produced  transcription-translation  system  derived  from  reticu¬ 
locyte  lysate  and  containing  the  phage  T7  RNA  polymerase  was 
used  to  express  a  protein  of  interest  from  DNA  template  in  one 
reaction  mixture.  A  putative  N-end  rule  substrate  in  the  lysate 
was  detected  through  its  increased  concentration  in  the  pres¬ 
ence  of  a  cognate  dipeptide  inhibitor  of  the  N-end  rule  pathway. 
We  began  by  examining  the  degradation  of  model  N-end  rule 
substrates,  which  were  derived  from  nsP4,  the  69-kDa  Sindbis 
virus  RNA  polymerase,  a  physiological  substrate  of  the  mam¬ 
malian  N-end  rule  pathway  that  bears  N-terminal  Tyr  (34). 
The  nsP4  protein  is  produced  during  Sindbis  virus  infection 
through  site-specific  cleavage  of  the  viral  polyprotein  precur¬ 
sor.  In  the  present  work,  the  Ub  fusion  technique  (9,  10,  51) 
was  used  to  synthesize,  in  reticuloc5d:e  lysate,  a  set  of  X-nsP4 
derivatives  that  contained  the  first  254  residues  of  nsP4  and  in 
addition  bore  different  residues  at  the  Ub-nsP4  junction.  These 
test  proteins,  Ub-Arg-nsP4i_254,  Ub-T3nr-nsP4i_254,  and  Ub- 
Met-nsP4i_254,  are  denoted  below  as  Ub-Arg-nsP4*,  Ub-Tyr- 
nsP4*,  and  Ub-Met-nsP4*,  respectively.  Their  N-terminal  Ub 
moiety  was  cotranslationally  cleaved  off  by  DUBs  in  the  lysate 
yielding  Tyr-nsP4*,  Arg-nsP4*,  and  Met-nsP4*,  respectively. 
Earlier  experiments  by  T.  Riimenapf  have  shown  that  these 


constructs  were  N-end  rule  substrates  in  the  transcription- 
translation  reticulocyte  lysate.^ 

When  a  Ub-X-nsP4*  fusion  was  expressed  in  reticulocyte 
lysate  and  monitored  as  a  function  of  time,  a  major  band 
corresponding  to  Ub-lacking  X-nsP4*  was  observed  (Fig.  1,  A 
and  C).  Although  the  electrophoretic  mobility  of  X-nsP4*  pro¬ 
teins  was  slightly  higher  than  expected  from  their  predicted 
molecular  mass  of  29  kDa,  the  observed  proteins  were  clearly 
X-nsP4*,  rather  than,  for  example,  Ub-X-nsP4*,  because  the 
removal  of  Ub  occurs  cotranslationally  or  nearly  so  (57),  and 
also  because  SDS-PAGE  of  the  same  samples  using  a  more 
concentrated  polyacrylamide  gel  revealed  the  --S-kDa  band  of 
labeled  free  Ub  (data  not  shown).  In  addition,  the  degradation 
patterns  of  presumed  X-nsP4*  derivatives  of  Ub-X-nsP4*  fu¬ 
sions  conformed  to  the  N-end  rule,  as  shown  below. 

The  metabolically  stable  Met-nsP4*  accumulated  rapidly 
during  the  first  30  min  of  the  transcription-translation  reaction 
and  reached  a  plateau  around  30  min,  because  the  protein 
synthesis  (but  not  the  activity  of  the  N-end  rule  pathway;  see 
below)  ceased  in  the  lysate  by  that  time  (Fig.  1,  C  and  D).  Thus, 
in  this  setting,  the  incubation  times  in  excess  of  —30  min  were 
operationally  equivalent  to  the  “chase”  part  of  a  pulse-chase 
experiment.  In  contrast  to  Met-nsP4*,  the  relative  levels  of 
Arg-nsP4*  (Fig.  1,  A  and  B,  lanes  labeled  c  (controls))  and 
TVr-nsP4*  (data  not  shown)  began  to  decrease  after  30  min, 
reflecting  their  continuing  degradation  by  the  N-end  rule  path¬ 
way.  Arg  and  Tyr  are,  respectively,  a  type  1  (basic)  and  a  type 
2  (bulky  hydrophobic)  destabilizing  residue  in  the  N-end  rule 
(10).  A  ladder  of  higher  molecular  mass  bands,  apparently  of 
multiubiquitylated  Arg-nsP4*,  was  observed  with  Arg-nsP4* 
(Fig.  lA)  but  not  with  the  metabolically  stable  Met-nsP4*  (data 
not  shown). 

The  addition  of  Arg-)3-Ala,  a  type  1  inhibitor  of  the  N-end 
rule  pathway  (25),  strongly  inhibited  the  ubiquitylation  and 
degradation  of  Arg-nsP4*  (Fig.  lA,  lanes  labeled  1).  The  same 
dipeptide  had  no  effect  on  the  degradation  of  TyT-nsP4*,  which 
bore  a  type  2  destabilizing  N-terminal  residue  (data  not 
shown).  Conversely,  Trp-Ala,  a  type  2  inhibitor,  strongly  de¬ 
creased  the  degradation  of  Tyr-nsP4*  (Fig.  25)  but  had  little 
effect  on  the  degradation  of  Arg-nsP4*  (Fig.  lA,  lanes  labeled 
2).  At  1  mM,  none  of  the  several  dipeptides  tested,  including 
those  bearing  type  1,  type  2,  or  type  3  destabilizing  N-terminal 
residues  (25),  affected  the  relative  band  intensity  of  Met-nsP4* 
(Fig.  1,  C  and  D),  indicating  that  these  dipeptides  did  not 
perturb  transcription  and  translation  in  this  system.  However, 
at  10  mM,  some  of  the  dipeptides  significantly  delayed  the 
s3mthesis  of  X-nsP4*  (data  not  shown).  Therefore  the  experi¬ 
ments  were  carried  out  with  dipeptides  at  the  initial  concen¬ 
tration  of  1  mM.  At  this  concentration,  the  inhibition  of  the 
N-end  rule  pathway  was  strong  but  incomplete.  The  inhibition 
of  degradation  of,  for  example,  a  type  1  substrate  such  as 
Arg-nsP4*  by  a  cognate  dipeptide  could  be  observed  through 
large  differences  in  the  amount  of  a  test  protein  produced  by  30 
min,  the  time  of  cessation  of  protein  synthesis  in  the  lysate,  and 
through  even  larger  differences  at  60  min  (Fig.  IB), 

m-Calpain  Bearing  N-terminal  Lysine  Is  Not  a  Substrate  of 
the  N-End  Rule  Pathway — m-Calpain  is  a  calcium-activated 
cysteine  protease  composed  of  an  80-kDa  large  subunit  (mCL) 
and  a  30-kDa  small  subunit  (mCS)  (58).  The  m-  and  p,-calpains 
are  ubiquitously  expressed  in  metazoan  cells.  In  the  presence  of 
Ca^"^,  calpain  subunits  undergo  autoproteolytic  cleavages  in 
their  N-terminal  regions.  The  processed  calpains  have  in¬ 
creased  activity.  The  in  vitro  autoproteolytic  cleavage  of  mCL 
was  shown  to  remove  the  first  9  residues  of  mCL,  yielding  a 
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Fig.  1.  Effects  of  dipeptides  on  the  accumulation  of  Arg“nsP4*  and  Met-nsP4*  in  the  transcription-translation  reticulocyte  lysate. 

Plasmids  expressing  Ub-Arg-nsP4*  (A)  or  Ub-Met-nsP4*  (C)  were  added  to  reticulocyte  lysate  in  the  presence  of  methionine  and  either  in  the 
absence  of  N-end  rule  inhibitors  {lane  c  (controls))  or  in  the  presence  of  either  1  mM  Arg-j3-Ala  {lanes  i),  or  1  mM  Trp-Ala  {lanes  2),  or  1  mM  Ala-Lys 
{lanes  5),  or  1  mM  Lys-Ala  {lanes  4),  or  1  mM  Ala-Arg  {lanes  5).  The  samples  in  lanes  1—5  also  contained  0.15  mM  bestatin.  The  reactions  were  carried 
out  for  10,  20,  30,  60,  120,  and  240  min,  followed  by  SDS-PAGE,  autoradiography,  and  quantitation.  The  position  of  R-nsP4*  band  and  the 
multi-Ub-containing  species  are  indicated  on  the  left.  The  positions  of  molecular  mass  markers  are  indicated  on  the  right.  B,  relative  amounts  of 
^®S-labeled  Arg-nsP4*  in  A  were  determined  using  Phosphorlmager  and  were  plotted  as  a  function  of  incubation  time.  The  amount  of  Arg-nsP4* 
in  the  reaction  without  dipeptides  at  30  min  was  assigned  the  value  of  100.  Control  reaction  are  as  follows:  in  the  absence  of  dipeptides  (□);  in  the 
presence  of  Arg-^-Ala  ( 0 );  in  the  presence  of  Trp-Ala  (A);  in  the  presence  of  Ala-Lys  (■);  in  the  presence  of  Lys-Ala  ( ♦ );  and  in  the  presence  of 
Ala-Arg  (A).  D,  relative  amounts  of  [^®S]Met-nsP4*  in  C  were  determined  and  plotted  as  in  B. 


modified  subunit,  denoted  below  as  Lys-mCL,  which  bears  N- 
terminal  Lys  (59),  a  type  1  destabilizing  residue  (10).  We  asked 
whether  Lys-mCL  was  unstable  in  reticulocyte  lysate  and,  if  so, 
whether  its  instabiHty  required  N-terminal  Lys  residue.  A  plas¬ 
mid  encoding  the  Ub-Lys-mCL  fusion  was  constructed,  and  the 
fusion  was  expressed  in  the  transcription-translation  lysate.  It 
was  found  that  under  standard  conditions  (in  the  absence  of 
added  Ca^^),  the  fusion-derived  Lys-mCL  was  metabolically 
stable  in  the  lysate  (Fig.  2A).  In  the  presence  of  added  Ca^*^  (2 
mM),  the  newly  formed  Lys-mCL  underwent  rapid  (apparently 
autolytic)  degradation  in  the  lysate.  Significantly,  this  Ca^"^- 
induced  degradation  of  Lys-mCL  was  not  inhibited  by  Arg-j3- 
Ala,  a  type  1  inhibitor  of  the  N-end  rule  pathway  (Fig.  2).  Thus, 
the  Lys-mCL  derivative  of  the  large  subunit  of  m-calpain  is  not 
a  substrate  of  the  N-end  rule  pathway  in  reticulocyte  lysate, 
despite  the  presence  of  a  destabilizing  N-terminal  residue  (see 
“Discussion”). 

The  muscle-specific  calpain  p94  is  homologous  to  the  ubiqui¬ 
tous  calpains  and  was  shown  to  be  metabolically  unstable  (53, 
60).  We  expressed  rat  p94  and  its  proteolytically  inactive  mu¬ 
tant  p94ci29A  reticuloc3d;e  lysate.  The  full-length  p94  was 
rapidly  cleaved  in  the  lysate,  yielding  several  fragments, 
whereas  proteolytically  inactive  p94ci29A  remained  stable 
(data  not  shown),  in  agreement  with  the  earlier  evidence  (53). 
Dipeptides  Arg-/3‘Ala  and  Trp-Ala  had  no  effect  on  the  relative 
yields  of  bands  corresponding  to  either  the  full-length  p94  or 
the  products  of  its  autolysis  (data  not  shown).  Thus,  neither 
p94  nor  its  autol5t;ically  produced  fragments  are  the  substrates 
of  the  N-end  rule  pathway. 


Degradation  of  RGS4  in  Reticulocyte  Lysate  Is  Decreased  by 
Type  1  Inhibitors  of  the  N-end  Rule  Pathway  and  by  a  Protea- 
some  Inhibitor — To  search  for  putative  N-end  rule  substrates 
in  a  systematic  way,  we  employed  a  modification  of  the  previ¬ 
ously  developed  in  vitro  screening  method  that  involves  the  use 
of  small  pools  of  cDNA  clones  (42,  43).  A  mouse  brain  cDNA 
library  was  constructed,  and  about  500  mouse  cDNA  pools, 
each  containing  about  50  cDNAs,  were  tested,  sequentially,  in 
the  transcription-translation  reticulocyte  lysate.  Each  reaction 
was  carried  out  in  either  the  absence  or  the  presence  of  a 
mixture  of  two  dipeptides,  Arg-/3-Ala  and  Trp-Ala,  which  have 
been  shown  to  inhibit  selectively  the  degradation  of  engineered 
N-end  rule  substrates  in  the  lysate  (Figs.  1,  A  and  B,  and  2B). 
The  resulting  ^®S-labeled  proteins  were  visualized  by  SDS- 
PAGE  and  autoradiography.  We  searched  for  protein  bands 
that  were  selectively  enhanced  in  the  presence,  but  not  in  the 
absence,  of  the  dipeptide  inhibitors.  A  cDNA  pool  containing  a 
putative  substrate  of  the  N-end  rule  pathway  was  subjected  to 
subcloning,  followed  by  re-analysis  in  the  lysate.  This  proce¬ 
dure  yielded  specific  cDNAs  encoding  putative  N-end  rule 
substrates. 

Thus  far,  we  found  7  cDNAs  that  encoded  proteins  whose 
expression  patterns  identified  them  as  putative  substrates  of 
the  N-end  rule  pathway.  Most  of  these  cDNAs  encoded  N- 
terminally  truncated  polypeptides,  produced  by  translation 
from  Met  codons  that  were  internal  in  the  corresponding  wild 
type  ORFs  (data  not  shown).  Although  some  of  these  truncated 
proteins  may  prove  to  be  physiologically  relevant  substrates  of 
the  N-end  rule  pathway,  we  focused  at  first  on  a  putative  N-end 
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Pig.  2.  Lys-mCL  is  not  an  N-end  rule  substrate  in  reticulocyte 
lysate.  A,  plasmid  DNA  encoding  a  Ub-Lys-mCL,  a  Ub  fusion  to  a 
derivative  of  the  large  m-calpain  subunit  bearing  N-terminal  Lys  (see 
“Results”),  was  expressed  in  the  transcription-translation  reticulocyte 
lysate  in  the  presence  of  methionine.  The  reaction  was  allowed  to 
proceed  for  20  min  at  37  “C  (“pulse”).  Thereafter,  cycloheximide  and 
unlabeled  L-raethionine  were  added  to  all  samples,  to  the  final  concen¬ 
trations  of  0.1  mg/ml  and  0.5  mM,  respectively.  CaCl2  and/or  Arg-^-Ala 
were  added  to  some  of  the  samples,  to  the  final  concentrations  of  2  and 
2.5  mM,  respectively.  0.15  mM  bestatin  was  present  in  these  samples  as 
well.  TTie  reactions  mixtures  were  incubated  for  the  indicated  chase 
times  of  0,  1,  or  2  h,  followed  by  SDS-PAGE  and  autoradiography.  B, 
plasmid  DNA  encoding  Ub-Tyr-nsP4*  was  expressed  in  the  lysate,  and 
the  reactions  were  allowed  to  proceed  as  in  A  except  that  Trp-Ala  was 
added,  instead  of  Arg-p-Ala,  after  the  pulse.  The  positions  of  (deubiq- 
uitylated)  Lys-mCL  and  Tyr-nsP4*  bands  are  as  indicated  at  the  right. 


rule  substrate  encoded  by  a  full-length  cDNA,  which  was  found 
to  encode  the  mouse  RGS4,  a  member  of  the  RGS  family  of 
GAPs  for  specific  Ga  subimits  of  G  proteins  (44,  45).  The 
~3-kilobase  pair  RGS4  mRNA  contained  the  --2-kilobase  pair 
long  3'-imtranslated  region.  To  optimize  the  expression  of 
RGS4  in  reticuloc)de  lysate,  we  constructed  the  plasmid 
pcDNA3-RGS4,  which  contained  the  full-length  RGS4  ORF, 
followed  by  two  stop  codons,  and  lacked  the  wild  type  3'- 
untranslated  region  of  RGS4  mRNA.  As  with  the  original 
RGS4  cDNA  isolate  (data  not  shown),  the  transcription-trans¬ 
lation  of  pcDNA3-RGS4  3delded  two  protein  bands,  of  25  and  22 
kDa  (Fig.  3,  A  and  D).  The  larger  protein  was  the  full-length 
RGS4,  whereas  the  smaller  one  was  produced  by  translation 
that  started  from  a  downstream  Met  (AUG)  codon  at  amino 
acid  position  19,  because  pcDNA3-RGS4iyii9^,  in  which  the 
Met- 19  codon  was  converted  to  an  Ala  codon,  did  not  express 
the  lower  protein  band  (Fig.  SD). 

The  RGS4  cDNA  was  initially  picked  by  this  screen  because 
upon  the  incubation  times  of  2-4  h,  the  lower  (22-kDa)  band 
increased  in  intensity  in  the  presence,  but  not  in  the  absence,  of 
dipeptide  inhibitors.  However,  this  effect  was  later  found  to  be 
unrelated  to  the  N-end  rule  pathway,  as  it  was  observed  in  the 
presence  of  all  dipeptides  tested,  including  those  that  did  not 
inhibit  the  pathway  (Fig.  3A).  In  contrast,  at  the  incubation 
times  from  20  min  to  1  h,  the  relative  amount  of  the  full-length 
RGS4  increased  up  to  5-fold  in  the  presence  of  either  Arg-j3-Ala 
or  Lys-Ala,  the  type  1  inhibitors  of  the  N-end  rule  pathway 
(Fig.  3,  A  and  C),  but  not  in  the  presence  of  Trp-Ala  (type  2 
inhibitor  of  the  same  pathway),  or  Ala-Lys  and  Ala-Arg  (type  3 
inhibitors)  RGS4  (Fig.  4).  A  ladder  of  high  molecular  mass 
bands,  apparently  of  multiubiquitylated  RGS4,  was  observed 
above  the  RGS4  and  RGS4j^3^9;^  bands  at  10,  20,  and  30  min  of 
incubation  (Fig.  3,  A  and  B  and  data  not  shown).  Arg-j3-Ala  and 
Lys-Ala,  but  not  the  other  tested  dipeptides,  delayed  the  ap¬ 
pearance  of  multiubiquitylated  RGS4  (Fig.  3,  A  and  B).  The 


temporal  pattern  of  inhibition  of  degradation  of  RGS4  by  dipep¬ 
tide  inhibitors  was  similar  to  that  described  (and  explained) 
above  for  the  engineered  N-end  rule  substrate  Arg-nsP4*  (Fig. 
1,  A  and  B).  Taken  together,  these  data  suggested  that  RGS4 
was  a  type  1  substrate  of  the  N-end  rule  pathway  in  reticulo¬ 
cyte  lysate. 

At  longer  reaction  times  (either  2  or  4  h)  the  effect  of  type  1 
N-end  rule  inhibitors  on  the  relative  level  of  RGS4  was  not 
observed  (Fig.  3,  A  and  C,  and  Fig.  4).  By  contrast,  the  band  of 
a  model  N-end  rule  substrate  such  as  Arg-nsP4*  was  stronger 
in  the  presence  of  Arg-j3-Ala  inhibitor  at  all  reaction  times, 
including  2  and  4  h  (Fig.  IB  and  Fig.  4).  This  difference  in  the 
patterns  of  inhibition  of  RGS4  and  Arg-nsP4*  degradation 
could  be  explained  if,  in  contrast  to  Arg-nsP4*,  there  were  two 
pools  of  the  newly  formed  RGS4  molecules,  one  of  which  un¬ 
derwent  rapid  degradation,  while  the  other  remained  long- 
lived  and  accumulated  during  the  chase,  after  —30  min  of 
incubation.  The  fraction  of  short-lived  RGS4  would  be  strongly 
increased  by  the  type  1  inhibitors  Arg-j3-Ala  or  Lys-Ala  at  the 
early  times  of  incubation  (before  -^1  h),  but  over  longer  times 
(from  1  to  4  h)  this  degradation-susceptible  form  of  RGS4  would 
still  be  degraded  in  the  lysate,  presximably  because  of  incom¬ 
plete  inhibition  of  the  N-end  rule  pathway  by  dipeptides,  in 
addition  to  gradual  destruction  of  the  dipeptides  by  peptidases 
in  the  lysate.  Thus,  at  reaction  times  of  2  h  or  longer,  only  the 
subset  of  long-lived  RGS4  molecules  remained. 

The  fraction  of  short-lived  RGS4  comprised  75—80%  of  the 
total  RGS4;  the  remainder  of  RGS4  molecules  (15-25%)  was 
long-lived.  This  estimate  stemmed  from  the  fact  that  at  30  min, 
the  amount  of  RGS4  was  4  to  5  times  higher  in  the  presence  of 
Arg-j3-Ala  than  in  its  absence  (Fig.  3C,  Fig.  4,  and  data  not 
shown).  (That  the  effect  of  Arg-)3-Ala  did  not  result  from  its 
influence  on  the  rates  of  transcription/translation  in  the  lysate 
was  indicated  by  control  experiments  with  the  long-lived  Met- 
nsP4*  (Fig.  1,  C  and  D).)  In  contrast  to  RGS4,  most  Arg-nsP4* 
molecules  were  short-lived  in  the  lysate.  As  a  result,  the  rela¬ 
tive  amoimt  of  Arg-nsP4*  was  higher  in  the  presence  of  Arg-p- 
Ala  than  in  its  absence  throughout  the  4-h  incubation  (Fig,  4), 
although  the  amount  of  Arg-nsP4*  continued  to  decline  after 
1  h  even  in  the  presence  of  Arg-/3-Ala  (Fig.  1),  for  the  reasons 
described  above. 

We  also  examined  the  effect  of  a  proteasome  inhibitor, 
MG115  (61),  on  the  degradation  of  RGS4  and  Arg-nsP4*  in  the 
lysate.  MG115  at  10  /ulm  markedly  increased  the  levels  of  both 
RGS4  and  Arg-nsP4*  at  the  incubation  times  shorter  than  1  h 
(Fig.  5,  A-D).  MG115  at  0.1  mM  further  increased  the  amounts 
of  RGS4  and  Arg-nsP4*  at  these  incubation  times  (Fig.  5,  A-D). 
In  addition,  the  ladders  of  apparently  multiubiquitylated  RGS4 
and  Arg-nsP4*  were  enhanced  in  the  presence  of  MG115  (Fig. 
5,  A  and  C),  in  contrast  to  the  effect  of  N-end  rule  inhibitors, 
which  decreased  the  relative  level  of  multi-Ub  ladders  (Fig. 
3B).  These  patterns  were  consistent  with  MG115  inhibiting 
post-ubiquitylation  steps  of  the  proteasome-mediated  degrada¬ 
tion  of  RGS4  and  Arg-nsP4*,  in  contrast  to  the  action  of  dipep¬ 
tides,  which  inhibited  pre-ubiquitylation  steps  (25,  61). 

Similarly  to  the  effects  of  dipeptide  inhibitors  (Fig.  1),  the 
proteasome  level  inhibition  by  MG115  was  also  incomplete  and 
resulted  in  a  gradual  decrease  of  the  RGS4  and  Arg-nsP4* 
levels  after  30  min  of  incubation  (Fig.  5).  Note  that  this  de¬ 
crease  led  to  the  nearly  complete  disappearance  of  Arg-nsP4* 
by  4  h  of  incubation,  whereas  in  the  case  of  RGS4  the  decrease 
stopped  when  20-25%  of  RGS4  still  remained  in  the  lysate 
(Fig.  5,  B  and  D).  These  data  provided  independent  evidence  for 
the  existence  of  a  degradation-resistant  fraction  of  RGS4.  In 
summary,  we  identified  mouse  RGS4  as  a  type  1  substrate  of 
the  N-end  rule  pathway  in  the  reticulocyte  lysate. 
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Fig.  3.  Type  1  dipeptides  Arg-^-Ala  and  Lys-Ala  inhibit  degradation  and  ubiquitylation  of  RGS4  in  reticuloc}^  lysate.  A,  the 
plasmid  pcI)NA3-RGS4  (see  “Experimental  Procedures”)  was  expressed  in  the  transcription -translation  reticulocyte  lysate  in  the  presence  of 
[^®S]  methionine  and  either  in  the  absence  of  dipeptides  (lanes  c)  or  in  the  presence  of  1  mM  Arg-)3-Ala  (lane  i),  1  mM  Trp-Ala  (lane  2),  1  mM  Ala-Lys 
(lane  3),  1  mM  Lys-Ala  (lane  4),  and  1  mM  Ala-Arg  (lane  5).  Dipeptide-containing  samples  also  contained  0.15  mM  bestatin.  The  reactions  were 
carried  out  for  10,  20,  30,  60,  120,  and  240  min,  followed  by  SDS-PAGE,  autoradiography,  and  quantitation.  The  positions  of  mouse  RGS4  and  the 
multi-Ub-containing  species  are  indicated  on  the  left.  An  asterisk  marks  the  N-terminally  truncated  RGS4  derivatives,  which  were  produced 
through  alternative  translational  initiation  from  the  internal  start  codon  at  the  Met-19  position  of  RGS4.  The  positions  of  molecular  mass  markers 
are  indicated  on  the  right.  B,  autoradiographic  overexposure  of  the  20-rain  part  of  B  to  visualize  the  ladders  of  multi -Ub-containing  species.  C, 
relative  amounts  of  ^®S-RGS4  in  A  were  determined  using  Phosphorlmager  and  were  plotted  as  a  function  of  incubation  time.  The  amount  of  RGS4 
in  the  reaction  without  dipeptides  at  30  min  was  assigned  the  value  of  100.  Control  reactions  were  as  follows:  in  the  absence  of  dipeptides  ([H);  in 
the  presence  of  Arg-/3-Ala  (0);  in  the  presence  of  Trp-Ala  (A);  in  the  presence  of  Ala-Lys  (■);  in  the  presence  of  Lys-Ala  (♦);  and  in  the  presence 
of  Ala-Arg  (A).  D,  plasmids  encoding  either  wild  type  RGS4  (wt),  or  RGS4mi9a»  or  RGS4i9_205  were  expressed  in  the  lysate  either  in  the  absence 
of  dipeptides  (lane  c),  or  in  the  presence  of  1  mM  Arg-^-Ala  and  0.15  mM  bestatin  (lane  i),  or  in  the  presence  of  1  mM  Trp-Ala  and  0.15  mM  bestatin 
(lane  2). 


The  Degradation  Signal  of  RGS4  Is  a  Cysteine-based  N- 
degron — The  N-terminally  truncated  RGS4,  produced  through 
translation  initiation  at  the  second  (Met-19)  start  codon  of  the 
RGS4  ORF  (see  above),  was  not  a  substrate  of  the  N-end  rule 
pathway,  because  degradation  of  this  RGS4ig_205  derivative  in 
the  lysate  was  not  selectively  decreased  by  dipeptide  inhibitors 
of  the  N-end  rule  pathway  (Fig.  3A  and  data  not  shown).  This 
finding  strongly  suggested  that  the  UBRl-dependent  degron  of 
the  full-length  RGS4  was  located  near  its  N  terminus.  The 
deduced  N-terminal  sequence  of  RGS4  is  Met-Cys-Lys-Gly- 
(62),  Extensive  evidence  indicates  that  c3d:osolic  methionine 
aminopeptidases  (MetAPs),  a  class  of  proteases  present  in  all 
organisms,  cleave  off  N-terminal  Met  from  proteins  and  short 
peptides  depending  largely  on  the  identity  of  the  next  residue, 
which  becomes  N-terminal  after  the  cleavage.  In  particular, 
MetAPs  rapidly  (cotranslationally)  cleave  Met  off  N-terminal 
sequences  beginning  with  Met-Cys  (63-65).  Cys  is  a  stabilizing 
residue  in  prokaryotes  and  in  the  yeast  S.  cerevisiae  but  a 
secondary  destabilizing  residue  in  the  metazoan  N-end  rule 
pathway  (25,  26).  Specifically,  the  destabilizing  activity  of  N- 
terminal  Cys  is  tRNA-dependent,  strongly  suggesting  that  an 
Arg-tRNA-protein  transferase  (R-transferase)  conjugates  Arg, 
a  primary  destabilizing  residue,  to  the  N-terminal  Cys  of  an 
N-end  rule  substrate  (25).  This,  and  the  fact  that  the  degrada¬ 
tion  of  RGS4  was  inhibited  by  type  1  but  not  by  t3q)e  2  dipep¬ 
tides  (Fig.  3),  suggested  the  following  model:  the  N-terminal 


Met  of  newly  formed  RGS4  is  removed  by  MetAPs;  the  result¬ 
ing  N-terminal  Cys  is  arginylated  by  R-transferase,  and  the 
arginylated  RGS4  is  targeted  for  degradation  by  the  UBRl- 
encoded  E3a:  and  the  rest  of  the  N-end  rule  pathway. 

One  prediction  of  this  model  was  that  a  significant  fraction  of 
RGS4  in  the  reticulocyte  lysate  should  contain  the  N-terminal 
sequence  Arg-Cys-Lys-Gly-,  as  distinguished  from  either  Cys- 
Lys-Gly-  or  the  encoded  sequence  Met-Cys-Lys-Gly-.  Another 
prediction  was  that  a  mutational  replacement  of  Cys-2  with  a 
stabilizing  residue  that  still  allows  the  efficient  removal  of 
N-terminal  Met  should  make  the  resulting  RGS4  variant  long- 
lived  in  the  lysate.  We  tested  both  predictions. 

Since  the  conventional  N-terminal  sequencing  of  RGS4  pro¬ 
duced  in  the  lysate  would  have  required  a  major  scale  up  of  the 
reaction  protocol,  we  employed  radiochemical  sequencing.  A 
plasmid  encoding  RGS4mi9A  fused  to  glutathione  transferase 
(GST)  was  constructed.  The  RGS4mi9a‘CIST  protein  was  ex¬ 
pressed  in  the  lysate  and  was  found  to  be  degraded  by  the 
N-end  rule  pathway  indistinguishably  from  the  unmodified 
RGS4  (data  not  shown).  The  M19A  mutation  was  introduced  to 
preclude  the  formation  of  N-terminally  truncated  RGS4  vari¬ 
ant  (Fig.  3,  A  and  D),  RGS4|^i9^-GST  was  synthesized  in  the 
lysate  in  the  presence  of  Arg-^-Ala  (type  1  inhibitor  of  the 
N-end  rule  pathway)  and  either  [^H]  arginine,  pH] lysine,  or 
[^H]  leucine.  The  resulting  ^H-labeled  RGS4mi9a-GST  proteins 
were  purified  using  glutathione-Sepharose  affinity  chromatog- 
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raphy  and  SDS-PAGE,  The  band  of  was  sub¬ 

jected  to  automated  Edman  degradation,  and  released  in 
each  cycle  was  determined. 

RGS4mi9a’GST  labeled  with  pHlarginine  yielded  a  major 
peak  of  in  the  first  Edman  cycle,  followed  by  a  second  peak 
in  the  cycle  14,  consistent  with  Arg  at  positions  1  and  14  (Fig. 
6A).  Whereas  Arg  at  position  1  would  have  to  be  conjugated  to 
RGS4  posttranslationally,  as  predicted  by  the  above  model,  the 
first  encoded  Arg  residue  of  RGS4  was,  in  fact,  located  at  the 
encoded  position  14  of  arginylated  RGS4  (in  the  sequence 
frame  that  begins  with  Arg-Cys-Lys-Gly-)  (Fig.  6A),  This  ex¬ 
periment  was  carried  out  twice,  with  two  independent  prepa¬ 
rations  of  PH]RGS4mi9a-GST,  and  gave  the  same  result. 


Fig.  4.  The  relative  levels  of  RGS4  and  engineered  N-end  rule 
substrates  in  the  presence  of  cognate  dipeptide  inhibitors  in 
reticulocyte  lysate.  O,  the  ratios,  denoted  as  dipeptides/controls  (D/ 
C),  of  the  relative  amounts  of  RGS4  in  the  presence  versus  the  absence 
of  1  mM  Arg-^-Ala,  as  a  function  of  time  in  the  lysate.  These  ratios  were 
calculated  from  the  data  in  Fig.  3C.  0 ,  the  analogous  ratio  curve  but  for 
Tyr-nsP4*  in  the  presence  of  1  mM  Trp-Ala  (primary  data  not  shown). 
□,  the  analogous  ratio  curve  but  for  Arg-nsP4*  in  the  presence  of  1  mM 
Arg- /3-Ala  (calculated  from  the  data  in  Fig.  IB;  the  4-h  ratio  could  not  be 
determined  reliably  because  of  too  low  in  the  band  of  Arg-nsP4*  in 
the  absence  of  dipeptides).  Note  the  monotonous  increase  of  the  dipep¬ 
tides/controls  ratio  for  engineered  N-end  rule  substrate  but  not  for 
RGS4, 


RGS4mi9a-GST  labeled  with  [^H]lysine  yielded  virtually  no  at 
position  1  but  produced  a  major  peak  at  position  3  and  eleva¬ 
tions  at  positions  17  and  20,  consistent  with  the  known  posi¬ 
tions  of  three  Lys  residues  in  the  N-terminal  sequence  of  argi¬ 
nylated  RGS4  (Fig.  6B).  RGS4mi9a’GST  labeled  with 
pH] leucine  yielded  no  in  the  cycle  1  but  produced  peaks  in 
the  cycles  5  and  8,  once  again  consistent  with  the  known 
positions  of  two  Leu  residues  in  the  N-terminal  sequence  of 
arginylated  RGS4  (Fig.  6C).  Taken  together,  these  independent 
sets  of  radiochemical  sequencing  data  indicated  that  the  bulk  of 
RGS4  produced  in  reticulocyte  lysate  indeed  bore  the  N-termi¬ 
nal  sequence  Arg-Cys-Lys-Gly-,  with  the  Arg  residue  of  this 
sequence  not  encoded  by  RGS4  mRNA, 

To  verity  the  second  prediction  of  the  model,  we  constructed 
several  mutant  alleles  of  RGS4  and  expressed  them  in  the 
lysate  in  either  the  absence  or  presence  of  N-end  rule  inhibitors 
(Fig.  7).  It  was  found  that  the  replacement  of  Cys-2  with  either 
Gly,  Val,  or  Ala  residues  completely  stabilized  the  resulting 
RGS4c2Gj  RGSc2v^  RGS4c2a  ^  lysate  (Fig.  7),  in  agree¬ 
ment  with  the  modePs  prediction.  We  also  converted  Lys-3  of 
RGS4  into  either  Ser  or  Arg.  Interestingly,  whereas  the  Lys 
Ser  replacement  completely  stabilized  the  resulting  RGS4k3s 
variant,  the  Lys  ^  Arg  replacement  had  no  effect  on  the  deg¬ 
radation  of  RGS4k3r  (Fig,  7).  Thus,  although  Lys-3  is  not 
required  for  ubiquitylation  of  RGS4  (since  it  could  be  replaced 
with  the  non-ubiquitylatable  Arg),  the  presence  of  a  basic  res¬ 
idue  (either  Lys  or  Arg)  at  position  3  (position  2  after  the 
removal  of  N-terminal  Met)  is  required  for  the  RGS4  degrada¬ 
tion  by  the  N-end  rule  pathway. 

The  fact  that  RGS4k3s>  which  bears  N-terminal  Cys  and 
differs  from  RGS4  exclusively  at  position  3  (Lys  ^  Ser  replace¬ 
ment),  was  stable  in  the  lysate  (Fig.  7)  indicated  that  the 
presence  of  N-terminal  Cys  was  not  sufficient  for  making  a 
protein  an  N-end  rule  substrate.  To  explore  this  issue  with 
other  natural  proteins,  we  expressed  in  the  lysate  the  mouse 
protein  G^5l,  a  member  of  the  family  of  /3  subunits  of  hetero- 
trimeric  G  proteins  (66).  The  predicted  N-terminal  sequence  of 
G/35L  is  (Met)-Cys-Asp-Gln-Thr-.  We  found  that  Arg-/3-Ala,  the 
type  1  inhibitor  of  the  N-end  rule  pathway,  had  no  effect  on  the 


Fig.  5.  Proteasome  inhibitor  MGl  15 
decreases  degradation  of  RGS4  and 
Arg-nsP4*  in  reticulocyte  lysate. 
RGS4  (A)  and  Arg-nsP4*  (C)  were  ex¬ 
pressed  in  the  transcription -translation 
reticuloc)rte  lysate  in  the  presence  of 
[^®S]  methionine  and  either  in  the  absence 
of  MGl  15  (Me2SO  solvent  alone;  lanes  c) 
or  in  the  presence  of  10  fiM  (lane  1)  and 
100  ^M  (lane  2)  MG115.  The  reactions 
were  carried  out  for  10,  20,  30,  60,  120, 
and  240  min,  followed  by  SDS-PAGE,  au¬ 
toradiography,  and  quantitation.  The  po¬ 
sitions  of  RGS4  and  Arg-nsP4*  bands  and 
of  the  ladder  of  multi-Ub-containing  spe¬ 
cies  are  indicated  on  the  left,  B,  relative 
amounts  of  ^®S-RGS4  in  A  were  deter¬ 
mined  using  Phosphoriraager  and  were 
plotted  as  a  function  of  incubation  time. 
The  amount  of  RGS4  in  the  reaction  with¬ 
out  MGl  15  at  30  min  was  assigned  the 
value  of  100.  □,  control  reaction,  in  the 
absence  of  MG115;  0 ,  in  the  presence  of 
10  fM  MG115;  A,  in  the  presence  of  100 
fjM  MG115.  £),  the  same  as  in  B  but  for 
Arg-nsP4*. 
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Fig.  6.  Radiochemical  N-terminal  sequencing  of  RGS4  pro¬ 
duced  in  the  reticulocyte  lysate.  RGS4i^i9^-GST  was  synthesized  in 
the  transcription-translation  reticulocyte  lysate  for  30  min  in  the  pres¬ 
ence  of  1  mM  Arg-j3— Ala  and  one  of  the  ^H-containing  amino  acids: 
pH]  arginine  (A),  [%] lysine  (B),  or  [^H] leucine  (C).  The  resulting 
labeled  RGS4mi9a“GST  was  purified  by  affinity  chromatography  and 
SDS-PAGE,  followed  by  radiochemical  N-terminal  sequencing  through 
Edman  degradation,  as  described  under  “Experimental  Procedures.” 
Plotted  are  the  amounts  of  (cpm)  recovered  in  each  Edman  cycle.  The 
inferred  N-terminal  sequence  of  RGS4mi9a-GST  (see  “Results”)  is 
shown  at  the  top  of  each  panel,  with  the  expected  residues  high¬ 
lighted  in  bold. 


pattern  of  accumulation  of  GjSgL  in  reticulocyte  lysate  (data  not 
shown). 

The  encoded  N-terminal  sequence  of  RGS4  is  similar  to  those 
of  RGS5  and  RGS16;  the  similarities  include  Cys-2  and  a  basic 
residue  at  position  3  (55,  67).  To  test  whether  RGS16  was  an 
N-end  rule  substrate,  we  expressed  RGS16  in  reticulocyte  ly¬ 
sate.  As  sho-wn  in  Fig.  8A,  the  relative  amount  of  RGS16  was 
indeed  significantly  higher  in  the  presence  of  Arg-/3-Ala  (type  1 
inhibitor)  than  either  in  the  presence  of  Trp-Ala  (type  2  inhib¬ 
itor)  or  in  the  absence  of  dipeptides.  The  temporal  pattern  of 
the  inhibitor-produced  increase  in  the  RGS16  concentration 
was  similar  to  that  described  for  RGS4  in  Fig.  3B  (note  the  30- 
and  60-min  points  in  Fig.  8A).  The  effect  of  Arg-)3-Ala  on  the 
accumulation  of  RGS16  (Fig.  8A),  although  significant,  was 
smaller  than  its  effect  on  the  accumulation  of  RGS4  (Fig.  3A). 
Longer  autoradiographic  exposures  showed  the  ladders  of 
multi-Ub  chains,  apparently  conjugated  to  RGS16  (Fig.  SB). 
Consistent  with  the  weaker  effect  of  Arg-/3-Ala,  the  relative 
level  of  RGS  16-specific  multi-Ub  chains  (in  the  absence  of  in¬ 
hibitors)  was  lower  than  that  of  RGS4-specific  multi-Ub  chains 
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Fig.  7.  Single  residue  substitutions  at  the  N  terminus  of  RGS4 
abolish  its  degradation  in  reticulocyte  lysate.  Wild  t3rpe  mouse 
RGS4  and  the  indicated  single  residue  derivatives  of  RGS4p  were 
expressed  in  the  transcription-translation  reticulocyte  lysate  in  the 
presence  of  F®S]  methionine  and  either  in  the  absence  of  N-end  rule 
inhibitors  (lane  c),  or  in  the  presence  of  1  mM  Arg-^-Ala  (lane  2),  or  in 
the  presence  of  1  mM  Trp-Ala  (lane  2).  Dipeptide -containing  samples 
also  contained  0.15  mM  bestatin.  The  reactions  were  carried  out  for  15, 
30,  60,  and  120  min,  followed  by  SDS-PAGE  and  autoradiography. 


(Figs.  3A  and  SB  and  data  not  shown).  Finally,  the  ubiquityla- 
tion  of  RGS  16  was  significantly  delayed  in  the  presence  of 
Arg-j3-Ala  (type  1  inhibitor)  but  not  in  the  presence  of  Trp-Ala 
(type  2  inhibitor)  (Fig.  8B).  Taken  together,  these  data  (Fig.  8) 
indicated  that  RGS  16  was  also  a  substrate  of  the  N-end  rule 
pathway  in  reticulocjrte  lysate. 

RGS4  Is  Metabolically  Unstable  in  Vivo — A  mouse  RGS4- 
expressing  plasmid  was  transiently  transfected  into  mouse  L 
cells,  and  the  metabolic  stability  of  RGS4  was  measured  in  a 
pulse-chase  assay,  using  an  antibody  to  RGS4.  The  transiently 
expressed  RGS4  was  found  to  be  short-lived  in  L  cells,  with  the 
half-life  of  40-50  min  (Fig.  9,  A  and  B).  When  L  cells  were 
treated  with  the  proteasome  inhibitor  MG132,  both  before  and 
during  pulse-chase,  RGS4  was  significantly  stabiHzed  (Fig.  9,  A 
and  B).  Thus,  similarly  to  the  results  with  reticulocyte  lysate, 
the  in  vivo  degradation  of  RGS4  was  proteasome-dependent. 

To  verify  whether  the  degradation  of  RGS4  in  vivo  required 
its  Cys-based  N-degron,  L  cells  were  transfected  with  the  plas- 
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mids  expressing  the  RGS4c2g  RGS4c2v  mutants,  and  the 
in  vivo  decay  curves  of  these  proteins  were  compared  with  that 
of  wild  type  RGS4.  It  was  found  that  these  variants  of  RGS4, 
which  were  completely  stable  in  reticulocyte  lysate  (Fig.  7), 
were  degraded  in  L  cells  similarly  to  wild  type  RGS4  (data  not 
shown). 

DISCUSSION 

We  searched  for  substrates  of  the  mammalian  N-end  rule 
pathway  by  testing  some  of  the  previously  known  candidates 
and  also  by  emplo3dng  a  modification  of  the  sib  selection-based 
in  vitro  screen  (42,  43)  in  a  transcription-translation  reticulo¬ 
cyte  lysate.  Two  N-end  rule  substrates  thus  identified  were 
mouse  RGS4  and  RGS16.  These  proteins  are  members  of  the 
RGS  family  of  GTPase-activating  proteins  (GAPs)  that  down- 
regulate  specific  G  proteins  (44-50,  67).  We  report  the  follow¬ 
ing  results. 

1)  The  in  vitro  activated,  proteolytically  processed  larger  (80 
kDa)  subunit  of  the  cysteine  protease  m-calpain  was  previously 
shown  to  bear  N-terminal  Lys  (59),  a  type  1-destabilizing  res¬ 
idue  in  the  N-end  rule  (10).  We  expressed  this  subunit,  termed 
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Fig.  8.  Type  1  N-end  rule  inhibitor  decreases  degradation  and 
ubiquitylation  of  RGS16  in  reticulocyte  lysate.  A,  RGS16  was 
expressed  in  the  transcription-translation  reticiiloc3rte  lysate  in  the 
presence  of  pH]leucine  and  either  in  the  absence  of  N-end  rule  inhibi¬ 
tors  (lane  c)  or  in  the  presence  of  1  mM  Arg-^-Ala  (lane  1)  or  1  mM 
Trp-Ala  (lane  2).  Dipeptide-containing  samples  also  contained  0.15  mM 
bestatin.  The  reactions  were  carried  out  for  10,  20,  30,  60, 120,  and  240 
min,  followed  by  SDS-PAGE  and  fluorography.  B,  the  same  as  m  A,  but 
with  autoradiographic  overexposure  to  visualize  the  ladders  of  multi- 
Ub-containing  species. 


Lys-mCL,  in  the  reticulocyte  lysate  as  a  Ub-Lys-mCL  fusion. 
The  resulting  Lys-mCL  was  not  a  substrate  of  the  N-end  rule 
pathway,  despite  the  presence  of  a  destabilizing  N-terminal 
residue.  Since  an  N-degron  is  a  bipartite  signal  (12, 13, 15),  the 
absence  of  active  N-degron  from  Lys-mCL  could  be  due  to  the 
absence  of  a  targetable  internal  Lys  residue,  the  second  deter¬ 
minant  of  an  N-degron.  Another  possibility  is  a  steric  hin¬ 
drance  in  the  binding  of  UBRl  to  the  N-terminal  lysine  of 
Lys-mCL.  Thus,  a  destabilizing  N-terminal  residue  is  not  the 
sole  essential  determinant  of  an  active  N-degron,  as  demon¬ 
strated  previously  with  engineered  N-end  rule  substrates  (12). 

2)  A  major  fi"action  (75-85%)  of  mouse  RGS4  synthesized  in 
reticulocyte  lysate  was  rapidly  degraded  by  a  pathway  that  was 
proteasome-dependent  and  apparently  also  Ub-dependent.  The 
other  15-25%  of  the  newly  made  RGS4  was  found  to  be  stable 
in  the  lysate. 

3)  The  degradation  of  RGS4  was  inhibited  by  dipeptides 
bearing  type  1-destabilizing  N-terminal  residues  (Arg-^-Ala  or 
Lys-Ala)  but  was  unaffected  by  dipeptides  bearing  a  type  2-de- 
stabilizing  N-terminal  residue  (Trp-Ala)  or  a  type  3  residue 
(Ala-Lys  and  Ala-Arg). 

4)  Radiochemical  sequencing  of  RGS4  produced  in  reticulo¬ 
cyte  lysate  and  labeled  with  either  pH] arginine,  pH] lysine,  or 
pH] leucine  revealed  the  presence  of  posttranslationally  conju¬ 
gated  Arg  at  the  N  terminus  of  RGS4.  The  deduced  N-terminal 
sequence  of  RGS4  was  Met-Cys-Lys-Gly-.  The  observed/in¬ 
ferred  sequence  was  Arg-Cys-Lys-Gly-.  This  result  and,  inde¬ 
pendently,  the  fact  that  degradation  of  RGS4  was  inhibited  by 
t5q)e  1  but  not  by  type  2  dipeptides  strongly  suggested  the 
following  model:  the  N-terminal  Met  of  newly  formed  RGS4  is 
removed  by  MetAPs;  the  resulting  N-terminal  Cys  is  arginy- 
lated  by  R-transferase;  and  the  arginylated  RGS4  is  targeted 
for  processive  degradation  by  the  UBRl -encoded  E3a  and  the 
rest  of  the  N-end  rule  pathway. 

5)  In  agreement  with  a  prediction  of  this  model,  the  degra¬ 
dation  of  RGS4  in  reticulocyte  lysate  was  found  to  require 
Cys-2  residue,  which  becomes  N-terminal  after  the  MetAP- 
mediated  cotranslational  removal  of  Met.  Specifically,  the 
RGS4  mutants  C2G,  C2V,  and  C2A,  in  which  Cys-2  was  re¬ 
placed  with  Gly,  Val  or  Ala,  were  completely  stable  in  the 
lysate.  The  former  two  residues  are  stabilizing  in  the  N-end 
rule,  and  neither  of  the  3  residues  is  expected  to  interfere  with 
the  removal  of  N-terminal  Met  by  MetAPs  (64). 

6)  A  Lys  — >  Ser  replacement  at  the  encoded  position  3  of 
RGS4  also  completely  stabilized  the  resulting  RGS4k3s  pro¬ 
tein.  However,  a  Lys  Arg  replacement  at  this  position  had  no 
effect  on  the  degradation  of  RGS4k3r.  Thus,  Lys-3  is  not  re¬ 
quired  for  ubiquitylation  of  RGS4.  However,  the  presence  of  a 
basic  residue  (either  Lys  or  Arg)  immediately  after  Cys  is 


Fig.  9.  In  vivo  degradation  of  RGS4 
in  transiently  transfected  mouse  L 
cells.  A,  L  cells  were  transfected  with  an 
RGS4-expressing  plasmid  (see  “Experi¬ 
mental  Procedures”).  Cells  were  preincu¬ 
bated  in  methionine-free  medium  for  45 
min  and  then  labeled  for  12  min  with 
methionine/cysteine  and  chased  for 
0,  1,  or  2  h.  Either  50  yM  MG132  (from  a 
stodk  solution  in  MegSO)  or  the  equiva¬ 
lent  amount  of  Me2SO  were  present  dur¬ 
ing  both  preincubation  in  methionine-free 
medium  and  the  pulse-chase.  Cell  ex¬ 
tracts  were  immunoprecipitated  with  an- 
ti-RGS4  antibody,  followed  by  SDS-PAGE 
and  autoradiography.  B,  the  pattern  in  A 
was  quantitated  using  Phosphorlmager. 
The  amount  of  in  RGS4  at  time  0  (end 
of  the  pulse)  was  taken  as  100%.  □,  no 
proteasome  inhibitor;  circo,  in  the  pres¬ 
ence  of  50  yM  MG132. 
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essential  for  the  RGS4  degradation  by  the  N-end  rule  pathway. 
One  possibility  is  that  the  requirement  for  a  basic  residue  at 
this  position  reflects  the  substrate  specificity  of  an  uncharac¬ 
terized  R-transferase  that  arginylates  the  N-terminal  Cys. 

7)  The  mouse  protein  GjBgL,  a  member  of  the  family  of  ]8 
subunits  of  heterotrimeric  G  proteins  (66)  that  bore  the  (initial) 
N-terminal  Met-Cys,  was  tested  and  found  not  to  be  an  N-end 
rule  substrate,  similarly  to  the  RGS4k3s  mutant,  which  also 
bore  the  (initial)  N-terminal  sequence  Met-Cys  but  was  not 
degraded  by  the  N-end  rule  pathway.  Thus,  the  N-terminal  Cys 
of  wild  type  RGS4  is  an  essential  determinant  of  its  N-degron, 
but  other  N  terminus-proximal  residues  are  relevant  as  well. 

8)  Similarities  among  the  N-terminal  sequences  of  RGS4, 
RGS5,  and  RGS16  (67)  suggested  that  RGS5  and  RGS16  may 
also  be  N-end  rule  substrates.  This  prediction  was  tested,  thus 
far,  with  RGS16,  and  was  confirmed. 

9)  RGS4  was  transiently  expressed  in  mouse  L  cells  and 
found  to  be  an  unstable  protein,  with  the  half-life  of 40-50  min. 
The  in  vivo  degradation  of  RGS4  was  proteasome-dependent, 
as  indicated  by  the  nearly  complete  stabilization  of  RGS4  in  the 
presence  of  proteasome  inhibitor  MG132.  However,  the  target¬ 
ing  of  RGS4  in  this  in  vivo  setting  involved  primarily  a  degron 
distinct  from  the  Cys-based  N-degron,  because  N-terminal 
variants  of  RGS4  such  as  RGS4c2v  and  RGS4c2G»  which  were 
completely  stable  in  reticulocyte  lysate,  remained  unstable  in  L 
cells. 

We  consider  the  latter  result  first.  15-25%  of  the  reticulocyte 
lysate-produced  RGS4  was  resistant  to  degradation  by  the  N- 
end  rule  pathway  (paragraph  2  above).  An  analogous  protec¬ 
tion  of  RGS4  against  targeting  by  the  N-end  rule  pathway  in  L 
cells  might  involve  a  much  higher  fraction  of  the  newly  formed 
RGS4.  The  mechanism  of  protection  may  be  a  modification  of 
the  N-terminal  Cys,  for  example,  its  palmitoylation  (45,  47, 62, 
67)  or  acetylation  (68)  that  would  be  expected  to  preclude  the 
arginylation  and  degradation  of  RGS4  by  the  N-end  rule  path¬ 
way.  Thus  it  is  possible,  indeed  likely,  that  there  is  a  kinetic 
competition  among  these  reactions  at  the  N-terminal  Cys  of  a 
nascent  RGS4  protein.  In  experiments  to  address  this  model, 
we  added  varying  amounts  of  acetyl-CoA  or  palmitoyl-CoA  (the 
substrates  of  N-terminal  acetylases  and  palmitoyltransferases) 
to  reticulocyte  lysate;  no  effect  of  the  added  compounds  on  the 
degradation  of  newly  formed  RGS4  in  the  lysate  was  observed.® 
A  second  possibility  is  a  spatial  localization  of  RGS4  in  L  cells 
that  protects  it  from  degradation  by  the  N-end  rule  pathway 
but  leaves  RGS4  still  targetable  by  another  proteasome-de¬ 
pendent  pathway(s)  that  is  inactive  in  reticuloc3rtes.  Yet  an¬ 
other  possible  reason  for  the  difference  between  the  results 
with  reticulocyte  lysate  versus  L  cells  is  that  the  cysteine 
branch  of  the  N-end  rule  pathway  may  be  cell  type-specific;  for 
example,  a  Cys-specific  R-transferase  is  present  in  reticulo¬ 
cytes  but  might  be  expressed  at  a  lower  level  in  L  cells.  In 
recent  experiments,  RGS4  was  expressed  in  Xenopus  laevis 
oocytes,  through  microinjection  of  RGS4  mRNA.  It  was  found 
that,  similarly  to  the  results  with  reticulocyte  extracts,  RGS4 
was  degraded  in  oocytes  by  the  cysteine  branch  of  the  N-end 
rule  pathway.® 

One  difficulty  in  identifying  physiological  N-end  rule  sub¬ 
strates  (as  distinguished  from  those  produced  through  the  Ub 
fusion  technique  (10))  is  caused  by  the  fact  that  MetAPs  remove 
N-terminal  Met  from  newly  formed  proteins  if,  and  only  if,  the 
second  residue  in  a  polypeptide  chain  is  stabilizing  in  the 
yeast-type  N-end  rule.  Specifically,  the  known  MetAPs  remove 
N-terminal  Met  if  the  second  position  residues  are  Gly,  Val, 


®  I.  Davydov  and  A  Varshavsky,  unpublished  data. 

®  J.  Sheng,  I.  Davydov,  and  A.  Varshavsky,  unpublished  data. 


Ala,  Ser,  Thr,  Cys,  or  Pro  (63—65).  All  of  these  residues  are 
stabilizing  in  the  yeast-type  N-end  rule  (10).  Therefore,  a  nat¬ 
ural  substrate  of  an  N-end  end  rule  pathway  that  bears  an 
N-degron  in,  for  instance,  S.  cerevisiae  cells  can  be  produced 
exclusively  through  cleavages  (anywhere  along  the  polypeptide 
chain)  by  proteases  distinct  from  MetAPs.  An  example  of  N-end 
rule  substrate  of  this  class  is  the  S.  cerevisiae  protein  SCCl,  a 
subunit  of  the  cohesin  complex  that  holds  together  sister  chro¬ 
matids.  In  a  reaction  that  requires  ESPl,  the  566-residue  SCCl 
is  cleaved  (at  the  time  of  sister  chromatid  separation)  at  posi¬ 
tions  180  and/or  268,  resulting  in  fragments  that  bear  N-ter¬ 
minal  Arg,  a  type  l-destabilizing  residue  (69).  These  fragments 
were  fmmd  to  be  degraded  by  the  N-end  rule  pathway  in  vivoJ^ 

The  situation  in  metazoans  such  as  mammals  is  similar  to 
the  one  in  yeast,  in  that  the  yeast-type  destabilizing  residues 
cannot  be  exposed  at  the  N  termini  of  mammalian  proteins 
through  cleavages  by  the  known  MetAPs  (10,  64).  The  differ¬ 
ence  here  is  that  Cys,  Ala,  Ser,  Thr,  and  Pro  are  stabilizing 
residues  in  yeast  but  destabilizing  in  mammalian  cells  (10,  25, 
70).  In  contrast  to  the  other  destabilizing  residues,  the  second 
position  Cys,  Ala,  Ser,  Thr,  and  Pro  can  be  efficiently  exposed 
at  the  N  termini  of  newly  formed  proteins  through  the  action  of 
MetAPs.  Of  these  five  residues,  N-terminal  Cys  is  a  special 
case;  its  destabilizing  activity  requires  its  posttranslational 
conjugation  to  Arg,  a  primary  destabilizing  residue  recognized 
by  the  type  1  site  of  UBRl  (E3a)  (10). 

The  finding  of  the  dependence  of  destabilizing  activity  of 
N-terminal  Cys  on  the  presence  of  tRNA  (25,  26),  and  the 
identification,  in  the  present  work,  of  Arg  as  the  posttransla- 
tionally  linked  N-terminal  residue  of  mouse  RGS4,  indicated 
the  existence  of  an  Arg-tRNA-protein  transferase  (R-transfer¬ 
ase)  that  conjugates  Arg  to  N-terminal  Cys.  This  R-transferase 
is  distinct  from  the  two  previously  characterized  mammalian 
R- transferases,  both  of  which  are  encoded  by  the  alternatively 
spliced  ATEl  gene  (27).  The  two  forms  of  mouse  ATEl  R- 
transferase  have  different  specific  activities  but  apparently  the 
same  substrate  specificity;  similarly  to  their  S,  cerevisiae  hom¬ 
olog,  they  arginylate  N-terminal  Asp  and  Glu  but  cannot  argi- 
nylate  N-terminal  Cys  (27). 

The  —20  known  proteins  of  the  mammalian  RGS  family  have 
in  common  the  ~  130-residue  RGS  domain  that  binds  to  Ga 
subunits  and  is  responsible  for  the  GAP  function  of  RGS  pro¬ 
teins  (44,  45).  As  would  be  expected  of  pleiotropic  regulators  at 
key  junctions  of  the  cellular  metabolism,  the  activity  of  RGS 
proteins  is  controlled  at  several  levels,  including  regulation  of 
their  s3Tithesis  and  localization  (44).  It  is  becoming  increas¬ 
ingly  clear  that  the  (conditional)  degradation  of  RGSs  is  yet 
another  way  in  which  the  activity  of  these  proteins  is  modu¬ 
lated  in  cells.  Besides  RGS4  and  RGS  16  of  the  present  work, 
one  other  member  of  the  family,  RGS7,  was  recently  shown  to 
be  unstable  in  vivo  (71,  72).  The  normally  short-lived  RGS7  is 
stabilized  through  its  interaction  with  polycystin,  a  PKD1~ 
encoded  protein  involved  in  polycystic  kidney  disease  (71).  The 
degradation  of  RGS7  is  also  decreased  upon  exposure  of  cells  to 
tumor  necrosis  factor  a;  the  resulting  increase  in  the  concen¬ 
tration  of  RGS7  may  contribute  to  sepsis-induced  changes  in 
the  nervous  system  (72).  The  degradation  signal(s)  of  RGS7 
remains  to  be  characterized. 

Our  major  results  are  the  findings  that  RGS4  and  RGS16 
(and  possibly  also  RGS5)  are  substrates  of  the  cysteine  branch 
of  the  N-end  rule  pathway  and  that  the  Cys-based  N-degron  of 
these  proteins  functions  through  the  posttranslational  arginy- 
lation  of  N-terminal  Cys  by  an  unidentified,  apparently  Cys- 
specific  R-transferase.  RGS4  and  RGS16  are  the  first  physio- 


^  H.  Rao  and  A.  Varshavsky,  unpublished  data. 
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logical  substrates  of  the  N-end  rule  pathway  that  bear  a 
secondary  destabilizing  N-terminal  residue,  which  functions 
through  its  conjugation  to  Arg,  a  primaiy  destabilizing  residue. 

Specific  functions  of  the  metabolic  instability  of  RGS4  and 
RGS16  remain  to  be  understood.  One  route  to  these  functions  is 
through  the  cloning  of  Cys-speciflc  R-transferase  and  construc¬ 
tion  of  mouse  strains  that  lack  this  enzyme  and  (therefore)  the 
cysteine  branch  of  the  N-end  rule  pathway.  It  would  also  be 
important  to  define,  in  detail,  Cys-proximal  N-terminal  se¬ 
quences  in  proteins  that  promote  the  arginylation  of  N-termi- 
nal  Cys.  This  information  will  facilitate  the  identification  of 
Cys-specific  N-end  rule  substrates  in  data  bases  of  protein 
sequences. 
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The  ubiquitin  (Ub)  fusion  technique  was  developed  in  1985-1986, 
through  experiments  in  which  a  segment  of  DNA  encoding  the  76-residue 
Ub  was  joined,  in  frame,  to  DNA  encoding  Escherichia  coli  j8-galactosidase 
(/Sgal).'-^  When  the  resulting  protein  fusion  was  expressed  in  the  yeast 
Saccharomyces  cerevisiae  and  detected  by  radiolabeling  and  immunoprecip- 
itation  with  an  anti-/3gal  antibody,  only  the  moiety  of  )8gal  was  observed, 
even  if  the  labeling  time  was  short  enough  to  be  comparable  to  the  time 
(1-2  min)  required  for  translation  of  the  Ub-/3gal  open  reading  frame 
(ORF).  It  was  found  that  in  eukaryotic  cells  the  Ub  moiety  of  the  fusion 
was  rapidly  cleaved  off  after  the  last  residue  of  Ub  (Fig.  1).'  The  proteases 
involved  are  called  deubiquitylating^  enzymes  (DUBs)  or  LJb-specific  pro¬ 
cessing  proteases  (UBPs).**-’  A  eukaryotic  cell  contains  more  than  10  dis¬ 
tinct  DUBs,  all  of  which  are  highly  specific  for  the  Ub  moiety.  The  in  vivo 


'  A.  Bacliniair,  D.  Finley,  and  A.  Varshavsky,  Science  234,  179  (1986). 

^  A.  Varshavsky,  Proc.  Nall.  Acad.  Sci.  U.S.A.  93,  12142  (1996). 

*  Ubiquitin  whose  C-terminal  (Gly-76)  carboxyl  group  is  covalently  linked  to  another  com¬ 
pound  is  called  the  ubiquilyl  moiety,  the  derivative  terms  being  ubiquitylation  and  ubiqiii- 
tylated.  The  term  Ub  refers  to  both  free  ubiquitin  and  the  ubiquityl  moiety.  This  nomencla¬ 
ture,  which  is  also  recommended  by  the  Nomenclature  Committee  of  the  International 
Union  of  Biochemistry  and  Molecular  Biology,''*  brings  Ub-related  terms  in  line  with  the 
standard  chemical  terminology. 

*  K.  Wilkinson  and  M.  Hochstrasser,  in  “Ubiquitin  and  the  Biology  of  the  Cell”  (J.-M.  Peters, 
J.  R.  Harris,  and  D.  Finley,  eds  ).  Plenum  Press,  New  York,  1998. 

*J.  W.  Tobias  and  A.  Varshavsky,  /  Biol.  Chein.  266,  12021  (1991). 

*  R.  T.  Baker,  J.  W.  Tobias,  and  A.  Varshavsky,  /  Biol  Chem.  267,  23364  (1992). 

’  C.  A.  Gilchrist,  D.  A.  Gray,  and  R.  T.  Baker,  J.  Biol.  Chem.  272,  32280  (1997). 
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Deubiquitylating  enzyme 
(DUB) 


X  prtifeiri  y 

Fig.  1.  The  ubiquitin  fusion  technique.  Linear  fusions  of  Ub  to  other  proteins  are  cleaved 
after  the  last  residue  of  Ub  by  deubiquitylating  enzymes  (DUBs)  (see  text).'*^ 


cleavage  at  the  Ub-polypeptide  junction  of  a  Ub  fusion  has  been  shown 
to  be  largely  cotranslational.®’^ 

One  physiological  function  of  the  cleavage  reaction  (Fig.  1)  is  to  mediate 
the  excision  of  Ub  from  its  natural  DNA-encoded  fusions  either  to  itself 
(poly-Ub)‘®  or  to  specific  ribosomal  proteins.**’*^  Many  of  the  DUB  prote¬ 
ases  that  catalyze  the  cleavage  of  linear  Ub  fusions  can  also  cleave  Ub  off 
its  branched,  posttranslationally  formed  conjugates,  in  which  Ub  is  joined 
either  to  itself,  as  in  a  multi-Ub  chain,  or  to  other  proteins.'^  A  branched 
Ub-protein  conjugate  usually  comprises  a  multi-Ub  chain  covalently  linked 
to  an  internal  lysine  residue  of  a  substrate  protein.  The  ubiquitylated  sub¬ 
strate  is  processively  degraded  by  the  26S  proleasome,  an  ATP-dependenl 
multisubunit  protease.'^-’’  For  reviews  of  the  Ub  system,  see  Refs.  18-22. 

Another  finding  about  the  DUB-mediated  cleavage  reaction  (Fig.  1) 

*  N.  Johnsson  and  A.  Varshavsky,  EMBO  /  13,  2686  (199-4). 

C  Turner  and  A.  Varshavsky,  submitled  (2()()()), 

“'D.  Finley,  E.  Ozkaynak,  and  A.  Varshavsky,  Cell  48,  1035  (1987). 

"  K.  L.  Redman  and  M.  Rechsteiner,  Nature  {London)  338,  438  (1989). 

D.  Finley,  B.  Bartel,  and  A.  Varshavsky,  Nature  {London)  338,  394  (1989). 

■'C  M.  Pickarl,  FASEB  J.  11,  1055  (1997). 

O.  C  oux,  k.  lanaka,  and  A.  L.  Goldberg,  An/ju.  Rev.  Biocheni.  65,  801  (1996). 

W.  Baumeislcr.  J.  Walz,  F.  Zllhl,  and  E.  Seemailer,  Cell  92,  367  (1998). 

M.  Rechsteiner,  in  “Ubiquitin  and  the  Biology  of  the  Cell”  (J.  M.  Peters,  J.  R.  Harris,  and 

D,  Finley,  eds.),  pp.  147-189.  Plenum  Press,  New  York,  1998. 

G.  N.  DeMartino  and  C.  A.  Slaughter,  J.  Biol.  Cheni.  274.  22123  (1999). 

‘^M.  Hochslrasser,  Annu.  Rev.  Genet.  30,  405  (1996). 
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Fig.  2.  The  N-end  rule  of  the  yeast  S.  cercvisiae,'^  Specific  residues  at  the  N  terminus  of 
a  test  protein  such  as  )3gal  are  produced  by  the  Ub  fusion  technique  (Fig.  1  and  text).  The 
in  vivo  half-lives  of  the  corresponding  proteins  are  indicated  on  the  right.  Stabilizing 

N-lerniinal  residues  (Mel,  Gly,  Ala,  Ser,  I'hr,  Cys,  Val,  and  iho)  arc  not  iccognized  by  Ubrlp 
(N-recognin),  the  E3  component  of  the  N-end  rule  pathway.  Primary  destabilizing  N-terminal 
residues  (Arg,  Lys,  His,  Phe,  Trp,  Leu,  Tyr,  and  lie)  are  directly  bound  by  either  type  1  or 
type  2  substrate-binding  sites  of  Ubrlp.  Secondary  destabilizing  N-terminal  residues  are 
arginylaled  by  the  AT  El -encoded  Arg-tRN  A-prolein  transferase  (K-transferase),  yielding 
the  N-terminal  Arg,  a  primary  destabilizing  residue.  Tertiary  destabilizing  N-terminal  residues 
Asn  and  Gin  are  deamidated  by  the  NTAl -encoded  N-terminal  amidohydrolase  (Nl-amidase), 
yielding  the  secondary  destabilizing  residues  Asp  and  Glu,  respectively.  The  N-end  rule  of 
mammalian  cells  is  similar  but  contains  fewer  stabilizing  residues.^ 


led  to  the  discovery  of  the  N-end  rule,  a  relation  between  the  ui  vivo  half- 
life  of  a  protein  and  the  identity  of  its  N-terminal  residue  (Fig.  2).'  First, 
it  was  shown  that  the  cleavage  of  a  Ub-A'-polypeptide  fusion  after  the 
last  residue  of  Ub  takes  place  regardless  of  the  identity  of  a  residue  X  at 
the  C-terminal  side  of  the  cleavage  site,  proline  being  the  single  exception. 
By  allowing  a  bypass  of  the  “normal”  N-terminal  processing  of  a  newly 
formed  protein,  this  result  yielded  an  in  vivo  method  for  placing  different 
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residues  at  the  N  termini  of  otherwise  identical  proteins.  Second,  it  was 
found  that  the  in  vivo  half-lives  of  the  resulting  test  proteins  were  deter¬ 
mined  by  the  identities  of  their  N-terminal  residues,  a  relation  referred  to 
as  the  N-end  rule  (Fig.  2).*  The  N-end  rule  pathway,  which  targets  the  N 
terminus-specific  degradation  signals,  called  the  N-degrons,  is  one  pathway 
of  the  Ub  system.  For  a  review  and  work  on  the  N-end  rule  pathway,  see 
Refs.  2  and  23-31. 

The  Ub  fusion  technique  (Figs.  1  and  2)  remains  the  method  of  choice 
for  producing,  in  vivo,  the  desired  N-terminal  residue  in  a  protein  of  interest. 
Owing  to  the  constraints  of  the  genetic  code,  nascent  proteins  bear  N- 
terminal  methionine  (formyl-Met  in  prokaryotes).  The  known  methionine 
aminopeptidases  (MAPs),  which  remove  N-terminal  Met,  do  so  only  if  the 
residue  to  be  exposed  is  stabilizing  according  to  the  yeast-type  N-end 
rule,^’^^  In  other  words,  MAPs  do  not  cleave  off  N-terminal  methionine  if 
it  is  followed  by  any  of  the  12  destabilizing  residues  (Fig.  2).  The  Ub- 
specific  DUB  proteases  are  free  of  this  constraint,  except  when  the  residue 
A"  of  a  Ub-Af-polypeptide  is  proline,  in  which  case  the  cleavage  still  takes 
place  but  at  a  much  lower  rate.^*^^  More  recently,  a  specific  DUB  was 
identified  that  can  efficiently  cleave  at  the  Ub-proline  junction.^ 

The  Ub  fusions  can  be  deubiquitylated  in  vitro  as  well.^^’^"*  The  high 
activity  and  specificity  of  DUBs  should  make  them  the  reagents  of  choice 
for  applications  that  involve,  for  example,  the  removal  of  affinity  tags  from 
overexpressed  and  purified  proteins.  Unfortunately,  there  are  no  commer¬ 
cially  available  DUBs  at  present,  in  part  because  of  difficulties  encountered 
in  purifying  and  stabilizing  large  DUBs  such  as  S.  cerevisiae  Ubplp,  and  also 
because  Proteinix  (Rockville,  MD),  a  company  that  has  held  the  licenses  for 
Ub  fusion  patents  over  the  last  decade,  has  not  commercialized  this  tech¬ 
nology. 

Another  major  application  of  the  Ub  fusion  technique  resulted  from 
the  observations  that  expression  of  a  protein  as  a  I  lb  fusion  can  dramatically 

C.  Byrd,  G.  C.  Turner,  and  A.  Varshavsky,  EMBO  J.  17,  269  {199H). 

Y.  T.  Kwon,  Y.  Reiss,  V.  A.  Fried,  A.  Hershko,  J.  K.  Yoon.  D.  K.  Gonda,  P.  Sangan,  N.  G. 

Copeland,  N.  A.  Jenkins,  and  A.  Varshavsky,  Proc.  Natl.  Acad.  Sci.  U.S.A.  95,  7898  (1998) 

1.  V.  IJavydov,  1),  Paira,  and  A.  Varshavsky,  Arch.  Bioclicni.  tUophys.  357,  317  (1998). 

Y.  T.  Kwon,  A.  S.  Kashina,  and  A.  Varshavsky,  Mol.  Cell.  Biol.  19,  182  (1999). 

Y.  T.  Kwon,  F.  Levy,  and  A.  Varshavsky,  /  Biol.  Chem.  274,  18135  (1999). 

F.  L^vy,  J.  A.  Johnslon,  and  A.  Varshavsky,  Eur.  J.  Biochem.  259,  244  (1999). 

^''T.  Suzuki  and  A.  Varshavsky,  EMBO  J.  18,  101  (1999). 

Y.  Xie  and  A.  Varshavsky,  Curr.  Genet.  36,  113  (1999). 

P.  O.  Falnes  and  S.  Olsnes,  EMBO  J.  17,  615  (1998). 

R.  A.  Bradshaw,  W.  W.  Brickey,  and  K.  W.  Walker.  Trends  Biochem.  Sci.  23,  263  (1998). 
”  E.  S.  Johnson,  B.  W.  Bartel,  and  A.  Varshavsky,  EMBO  J.  11,  497  (1992). 

D.  K.  Gonda,  A.  Bachinair,  I.  WUnning,  J.  W.  Tobias,  W.  S.  Lane,  and  A.  Varshavsky,  J 

Biol.  Chem.  264,  16700  (1989). 

R.  r.  Baker,  Curr.  Opin.  Biotechnol.  7,  541  (1990). 
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augment  the  yield  of  the  prolein.*^^  4  he  yield  enhancement  effect  of  Ub 
was  observed  with  short  peptides  as  well."*^^'^*  This  and  other  applications 
of  Ub  fusions  are  described  below,  with  references  to  the  original  articles 
and  specific  constructs. 

Production  and  Uses  of  N-Degrons 

An  N-degron  comprises  the  destabilizing  N-terminal  residue  of  a  protein 
and  an  internal  lysine  residue.^'^*^'^^  A  set  of  N-degrons  containing  differ¬ 
ent  N-terminal  residues  that  are  destabilizing  in  a  given  cell  defines  the  N- 
end  rule  of  the  cell.^  The  lysine  determinant  of  an  N-degron  is  the  site  of 
formation  of  a  substrate-linked  multi-Ub  chain. A  way  to  produce  an 
N-degron  in  a  protein  of  interest  is  to  express  the  protein  as  a  Ub  fusion 
in  which  the  junctional  residue  (which  becomes  N-terminal  on  removal  of 
the  Ub  moiety)  is  destabilizing  (Fig.  2).  An  appropriately  positioned  internal 
lysine  residue  (or  residues)  is  the  second  essential  determinant  of  N-degron. 
Many  natural  proteins  lack  such  “targetable”  lysines,  and  therefore  would 
remain  long-lived  even  if  their  N-terminal  residue  were  replaced  by  a 
destabilizing  residue.  One  way  to  bypass  this  difficulty  is  to  link  a  protein 
of  interest  to  a  relatively  short  (<50  residues)  portable  N-degron  that 
contains  both  an  N-terminal  destabilizing  residue  (produced  through  a  Ub 
fusion)  and  a  requisite  lysine  residue(s).  The  earliest  portable  N-degron  of 
this  kind  is  still  among  the  strongest  known  (Fig.  3B).‘  It  was  found, 
using  the  new  strategy  of  a  screen  in  the  sequence  space  of  just  two  amino 
acids,  lysine  and  asparagine,  that  certain  sequences  containing  exclusively 
lysines  and  asparagines  can  function  /;/  vivo  as  highly  effective  N-degrons. 
The  portability  and  modular  organization  of  N-degrons  make  possible  a 
variety  of  applications  whose  common  feature  is  the  conferring  of  a  constitu¬ 
tive  or  conditional  metabolic  instability  on  a  protein  of  interest. 

R.  Butt,  S.  Jonnalagadcia,  B.  P.  Monia,  E.  J.  Sternberg,  J.  A.  Marsh,  J.  M.  Stadcl,  D,  J. 
Ecker,  and  S.  T.  Crooke,  Proc.  Nail.  Acad.  Sci.  U.S.A.  86,  2540  (1989). 

D.  J.  Ecker,  J.  M.  Stadel,  T.  R.  Butt,  J.  A.  Marsh,  B.  P.  Monia,  D.  A.  Powers,  J.  A.  Gorman, 
P.  E.  Clark,  F.  Warren,  and  A.  Shaizinan,  J.  Hiol.  Chem.  264,  7715  (1989). 

-^«P,  Mak,  D.  P.  McDonnell,  N.  L.  Weigel,  W.  T.  Selirader,  and  B.  W.  O’Malley,  J.  lUol. 
Chem.  264,21613  (1989). 

^’R.  T.  Baker,  S.  A.  Smith,  R.  Marano,  J.  McKee,  and  P.  G  Board,  J.  Biol.  Chem.  269, 
25381  (1994). 

Y.  Yoo,  K.  Rote,  and  M.  Rechsteiner,  J.  Biol.  Chem.  264,  17078  (1989). 

A.  Pilon,  P.  Yost,  T.  E.  Chase,  G.  Lohnas,  T.  Burkett,  S.  Roberts,  and  W.  E.  Bentley, 
Biotechnol  Prog.  13,  374  (1997). 

A.  Bachinair  and  A.  Varshavsky,  Cell  56,  1019  (1989). 

C.  P.  Hill,  N.  L.  Johnston,  and  R.  E.  Cohen,  Proc.  Nad.  Acad.  Sci.  U  S  A  90,  4136  (1993) 
V,  Chau,  J.  W.  Tobias,  A.  Baclimair,  D.  Marriott.  D.  J.  Ecker,  D.  K.  Gonda,  and  A. 
Varshavsky,  Science  243,  1576  (1989). 


1411 


UBIOUITIN  FUSION  TECHNIQUE  AND  ITS  DESCENDANTS 


583 


Fig.  3.  The  UPR  (ubiquitin/prolein/reference)  technique.’^  (A)  A  tripartite  fusion  con¬ 
taining  a,  tlie  reference  protein  moiety  wliose  C  terminus  is  linked,  via  a  spacer  peptide,  to 
the  Ub  moiety.  The  C  terminus  of  Ub  is  linked  to  b,  a  protein  of  interest.  In  vivo,  this  tripartite 
fusion  is  cotransiationally  cleaved’  by  deubiquitylating  enzymes  (DUBs)  at  the  Ub-b  junction, 
yielding  equimolar  amounts  of  the  unmodified  protein  b  and  a-Ub,  the  reference  protein  a 
bearing  a  C-terminal  Ub  moiety.  If  a-Ub  is  long-lived,  a  measurement  of  the  ratio  of  a-Ub 
to  b  as  a  function  of  time  or  at  steady  state  yields,  respectively,  the  in  vivo  decay  curve  or 
the  relative  metabolic  stability  of  protein  (B)  Example  of  a  specific  UPR-type  Ub 

fusion.^’  This  fusion  contains  the  following  elements:  DHFRha,  a  mouse  dihydroft)laie  reduc¬ 
tase  (DHFR)  moiety  extended  at  the  C  terminus  by  a  sequence  containing  the  hemagglutinin- 
derived  ha  epitope;  the  Ub  moiety  (more  specifically,  the  Ub*^^^  moiety  bearing  the  Lys  -> 
Arg  alteration  at  position  48);  a  4()-residuc,  E.  coli  Lac  repressor-derived  sequence,  termed 
e*^  (extension  (e)  containing  lysines  (/C)]  and  shown  below  in  single-letter  abbreviations  for 
amino  acids:  a  variable  residue  X  between  Ub  and  e^;  the  E.  coli  /3gal  moiety  lacking  the 
first  24  residues  of  wild-type  /3-Oal.  The  lightning  arrow  indicates  the  siie  of  in  vivo  cleavage 
by  DUBs.-" 


N-Degron  and  Reporter  Proteins 

A  change  in  the  physiological  state  of  a  cell  that  is  preceded  or  followed 
by  the  induction  or  repression  of  specific  genes  can  be  monitored  through 
the  use  of  promoter  fusions  to  a  variety  of  protein  reporters,  such  as,  for 
example,  /3gal,  /3-glucuronidase,  iuciferase,  and  green  lluorescenl  protein 
(GFP).  A  long-lived  reporter  is  useful  for  detecting  the  induction  of  genes. 
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but  is  less  suitable  for  monitoring  either  a  rapid  repression  or  a  temporal 
pattern  that  involves  an  up-  and  downregulation  of  a  gene  of  interest.  A 
sufficiently  short-lived  reporter  is  required  in  such  settings.  The  metaboli- 
cally  unstable  X-^$a\  proteins  of  the  initial  N-end  rule  study'  (Fig.  2)  were 
the  first  such  reporters.  Over  the  last  decade,  other  protein  reporters, 
including  those  described  above,  were  metabolically  destabilized  by  ex¬ 
tending  them  with  either  a  portable  N-degron  or  a  “nonremovable”  Ub 
moiety.'*®"'”  The  latter  is  targeted  by  a  distinct  Ub-dependent  proteolytic 
pathway  called  the  UFD  pathway  (Ub/fusion/degradation).'  **  These  meta¬ 
bolically  unstable  proteins,  expressed  as  Ub  fusions,  should  be  particularly 
useful  in  settings  where  the  concentration  of  the  reporter  must  reflect  a 
recent  level  of  gene  activity.  Portable  N-degrons  were  also  used  to  destabi¬ 
lize  specific  protein  antigens,  thereby  enhancing  the  presentation  of  their 
peptides  to  the  immune  system.'"'®” 

N-Degron  and  Conditional  Mutants 

A  frequent  problem  with  conditional  phenotypes  is  their  leakiness, 
i.  e.,  unacceptably  high  residual  activity  of  either  a  temperature-sensitive 
(ti)  protein  at  nonpermissive  temperature  or  a  gene  of  interest  in  the  “off” 
sitate  of  its  promoter.  Another  problem  is  “phenotypic  lag,”  which  often 
occurs  between  the  imposition  of  nonpermissive  conditions  and  the  emer¬ 
gence  of  a  relevant  null  phenotype.  Phenotypic  lag  tends  to  be  longer  with 
proteins  that  are  required  in  catalytic  rather  than  stoichiometric  amounts. 

In  one  application  of  Ub  fusions  and  the  N-end  rule  pathway  to  the 
problem  of  phenotypic  lag,  a  constitutive  N-degron  (produced  as  a  Ub 
fusion)  was  linked  to  a  protein  expressed  from  an  inducible  promoter.®'  This 
method  is  constrained  by  the  necessity  of  using  a  heterologous  promoter  and 
by  the  constitutively  short  half-life  of  a  target  protein,  whose  levels  may 
therefore  be  suboptimal  under  permissive  conditions.  An  alternative  ap¬ 
proach  is  to  link  the  N-degron  to  a  normally  long-lived  protein  in  a  strain 
in  which  the  N-end  rule  pathway  can  be  induced  or  repressed.  Such  strains 
have  been  constructed  with  S.  cerevisiae^^'^^  but  can  also  be  designed  in 

C.  K.  Worley,  R.  Ling,  and  J.  Callis,  PUmt  Mol  Hiol  37,  337  (19^W). 

H.  Deichsel,  S.  Friedcl,  A.  Detlerbeck,  ('.  C  oyne,  U.  Ilainkcr,  and  11  K.  MacWilliains, 

Dev.  Genes  Evol  209,  63  (1999). 

I,  Paz,  J.-R.  Meiinier,  and  M.  Choder,  Gene  236,  33  (1999). 

S.  Johnson,  P.  C.  Ma,  1.  M.  Ota,  and  A.  Varshavsky,  J.  Biol.  Chem.  270,  17442  (1995). 

A.  Townsend,  J.  Bastin,  K.  Gould,  G.  Brownlee,  M.  Andrew,  B.  (  oupar,  D.  Boyle.  S.  Chan, 

and  G.  Smith,/,  Exp.  Med.  168,  1211  (1988). 

50  T  Tobery  and  R.  F.  Siliciano,  /.  Immunol.  162,  639  (1999). 

E.  C.  Park,  D.  Finley,  and  J.  W.  Szostak,  Proc.  Natl.  Acad.  Sci.  U.S  A  89,  1249  (1992). 

Z.  Moqtaderi,  Y.  Bai,  D.  Poon,  P.  A.  Weil,  and  K.  Stiuhl,  Nature  {London)  383,  188  (1996). 

M.  Ghislain,  R.  J.  Dohmen,  F.  Levy,  and  A.  Varshavsky,  EM  HO  J.  15,  4884  (1996). 
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Other  species,  including  mammalian  cells.  The  metabolic  stabilities,  and 
hence  also  the  levels  of  N-degron-bearing  proteins,  in  a  cell  with  an  induc¬ 
ible  N-end  rule  pathway  are  either  normal  or  low,  depending  on  whether 
Ubrlp,  the  recognition  (E3)  component  of  the  N-end  rule  pathway,  is 
absent  or  present. These  conditional  mutants  can  be  constructed  with 
any  cytosolic  or  nuclear  protein  whose  function  tolerates  an  N-terminal  ex¬ 
tension. 

Yet  another  design  is  a  portable  N-degron  that  is  inactive  at  a  low 
(permissive)  temperature  but  becomes  active  at  a  high  (nonpermissive) 
temperature.  Such  an  N-degron  was  constructed,  using  the  Ub  fusion  tech¬ 
nique,  from  a  specific  ts  allele  of  the  20-kDa  mouse  dihydrofolate  reductase 
(DHFR)  bearing  the  N-terminal  arginine,  a  strongly  destabilizing  residue.^"* 
Linking  this  DHFR-based,  heat-inducible  N-degron  to  proteins  of  interest 
yielded  a  new  class  of  ts  mutants,  called  td  (temperature-activated  degron). 
The  td  method  does  not  require  an  often  unsuccessful  search  for  a  ts 
mutation  in  a  gene  of  interest.  If  the  corresponding  protein  can  tolerate 
N-terminal  extensions,  the  corresponding  td  fusion  is  functionally  unper¬ 
turbed  at  permissive  temperature.  In  contrast,  low  activity  of  a  ts  protein 
at  permissive  temperature  is  a  frequent  problem  with  conventional  ts  mu¬ 
tants.  The  td  method  eliminates  or  reduces  the  phenotypic  lag,  because  the 
activation  of  N-degron  results  in  rapid  disappearance  of  a  td  protein.  An¬ 
other  advantage  of  the  td  technique  is  the  possibility  of  employing  two  sets  of 
conditions:  a  td  protein-expressing  strain  at  permissive  versus  nonpermissive 
temperature  or,  alternatively,  the  same  strain  versus  a  congenic  strain  lack¬ 
ing  the  N-end  rule  pathway,  with  both  strains  at  nonpermissive  tempera¬ 
ture.^**  This  powerful  internal  control,  provided  in  the  td  technique  by  two 
alternative  sets  of  permissive/nonpermissive  conditions,  is  unavailable  with 
conventional  ts  mutants.  Since  1994,  a  few  laboratories  described  successful 
uses  of  the  td  method  to  construct  ts  alleles  ol  s|)ccilic  proteins  (c.g.,  Refs. 
55  and  56).  A  recent  modification  of  the  fd  technique  combines  the  ga¬ 
lactose-inducible  overexulsion  of  Ubrlp  and  the  temperature-sensitive 
(td)  N-degron. 

N’ Degron  and  Conditional  Toxins 

A  major  limitation  of  the  current  pharmacological  strategies  stems  Iroin 
the  absence  of  drugs  that  are  specific  for  two  or  more  independent  molecular 


R.  J.  I>()hmen,  P.  Wu,  and  A.  Varshavsky,  Science  263.  1273  (I  W) 

G.  Caponigro  and  R.  Parker,  Genes  Dev  9,  2421  (1995) 

Wolf,  M.  Nicks,  S.  Deitz,  E.  van  Tuinen,  and  A.  Franzusoff.  fiiochem.  Hiophys.  Res. 
Commnn.  243,  191  (1998). 

K.  Labib,  J.  A.  'I'ercero,  and  J.  P.  X.  DiUlcy,  Science  288,  1643  (200(1) 
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targets.  For  the  reasons  discussed  in  detail  elsewhere,*’’  it  is  desirable 
to  have  a  therapeutic  agent  that  possesses  a  multitargel,  combinatorial 
selectivity,  which  requires  the  presence  of  two  or  more  predetermined 
targets  in  a  cell  and  simultaneously  the  absence  of  one  or  more  targets 
for  the  drug  to  exert  its  effect.  Note  that  simply  combining  two  or  more 
“conventional”  drugs  against  different  targets  in  a  mullidrug  regimen  would 
not  yield  the  multitarget  selectivity,  because  the  two  drugs  together  would 
perturb  not  only  cells  containing  both  targets  but  also  cells  containing  either 
one  of  the  targets. 

A  strategy  for  designing  protein-based  reagents  that  are  sensitive  to  the 
presence  or  absence  of  more  than  one  target  at  the  same  time  was  proposed 
in  1995.^’  A  key  feature  of  these  reagents  is  their  ability  to  utilize  codomi¬ 
nance,  the  property  characteristic  of  many  signals  in  proteins,  including 
degrons  and  nuclear  localization  signals  (NLSs).  Codominance,  in  this  con- 
“^text,  refers  to  the  ability  of  two  or  more  signals  in  the  same  molecule  to 
function  independently  and  not  to  interfere  with  each  other.  1  he  critical 
property  of  a  degron-based  multitarget  reagent  is  that  its  intrinsic  toxicity 
is  the  same  in  all  cells,  whereas  its  half-life  (and,  consequently,  its  steady 
state  level  and  overall  toxicity)  in  a  cell  depends  on  the  protein  composition 
of  the  cell,  specifically  on  the  presence  of  “target”  proteins  that  have  been 
chosen  to  define  the  profile  of  a  cell  to  be  eliminated.^’  A  related  but 
different  design  involves  a  toxic  protein  made  short-lived  (and  therefore 
relatively  nontoxic)  by  the  presence  of  a  degradation  signal  such  as  an  N- 
degron.  (The  latter  is  produced  by  the  Ub  fusion  technique.)  If  a  cleavage 
site  for  a  specific  viral  processing  protease  is  placed  between  the  toxic 
moiety  of  the  fusion  and  the  N-degron,  the  fusion  would  be  cleaved  in 
virus-infected  cells  but  not  in  uninfected  cells.  As  a  result,  the  toxic  moiety 
of  the  fusion  would  become  long-lived  (and  therelore  more  toxic)  only  in 
virus-infected  cells.^^  The  codominance  concept  and  the  ideas  about  pro¬ 
tein-size  multitarget  reagents  have  been  extended  to  small  (<l-kDa) 
multitarget  compounds.^** 

Overproduction  of  Proteins  as  Ubiqiiitin  Fusions 

A  major  application  of  the  Ub  fusion  technique  is  its  use  to  augment 
the  yields  of  recombinant  proteins.'*’  I’his  approach  increases  the  yield 
of  short  peptides  as  well.'**''"  *’"  The  yield-enhancing  etiect  of  Ub  was  ob- 
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served  not  only  with  eukaryotic  cells  (where  the  Ub  moiety  is  present  m 
a  nascent  fusion  but  not  in  its  mature  counterpart)  but  a  so 
which  lack  the  Ub  system,  including  DUBs,  and  therefoie  retail 
moiety  in  a  translated  fusion.”-^’  {Escherichia  coh  transformed  with  a 
plasmid  expressing  the  S,  eeremwe  DUB  Ubpip  acquires  the  ability 

‘‘‘“Syilld'enhancta°g"effcct  ot  Ub  stems  at  least  in  part  from  rapid 
folding  of  the  nascent  Ub  moiety,  whose  presence  at  the  N  ^ 

emerging  polypeptide  chain  may  thereby  partially  protect  the  sl'H  unfo  ded 
TaiS  aScL  by  proteolytic  pathways  of  the  cytosol  (most  of  these 
piays  are  a  part  of  the  Ub  system).  The  remarkably  strong  tncreases 
h,  protein  yield  even  in  eukaryotic  cells,  where  H-'  Ub  mornty 
is  retained  transiently  (for  it  is  rapidly  removed  by  DUBs),  suggest  that 
this  protection  by  Ub  is  particularly  critical  during  translation,  when 
emeJling  partially  unfolded  polypeptide  chain  may  preset  degrons  that 
me  bur"fd^n  the  folded  version  of  the  same  polypeptide.  The  chaperone 
role  of  Ub  in  this  setting  reflects  one  of  its  physiological  functions  Specih- 
ca  Iv  the  experiments  with  natural  Ub  fusions  containing  ribosomal  proteins 
hS-sSha.  Ihe  iransicn.  presence  of  Ub  °  1“'^ 

moiety  is  required  for  the  efficient  incorporation  of  that  moiety  '"‘o 
nascent  ribosomes,'^  most  likely  because  of  the  transient  protection  effect 

Ub'^ledialed  increase  in  foial  yield  is  oflei,  »“Ompanied  by  an 
even  greater  increase  in  the  solubility  of  overexpressed  protein.  In  this 
regard  the  effect  of  Ub  is  analogous  to  that  of  several  other  proteins,  such 
X>;edoxin«  and  maltose-binding  protein  (MBP).«  When  these  moiet.es 
are  cotranslationally  linked  to  a  protein  of  interest,  ‘  ^ 

yield  and  solubility.  A  model  of  the 

MBP“  may  also  be  relevant  to  the  effect  of  Ub  moiety.  >  a 

partially  unfolded  nascent  protein  is  presumed  to  weakly  tnterac  witl 
nearby  (upstream)  MBP  moiety,  thereby  transiently  precluding  inteimole  - 
..lar  self-interactions  that  could  result  in  irreversible  aggregation  before 
Ihe  protein  has  had  Ihe  time  to  attain  its  mature  conlormation. 

The  first  engineered  Ub  fusions  utilized  pUB23-X,  a 
copy  plasmids  that  expressed  Ub-A'-/3gal  proteins  containing  d.f  e  enl 
iimclioinl  residues  (A')  in  S.  ccrcvisiae  liom  a  galaclose-indutibk.  gliicc  .u 
repressible  promoter.' «  Subsequent  designs  facilitated  the  ^ 

ORFs  encoding  Ub-A’-polypeptide  fusions  by  introducing  a  Audi  (Sst 

J,  W,  Tobias.  T.  E.  Shrader,  (5.  Rocap.  and  A.  Varshavsky,  .SVh'mr  m’  mcCov 

E,  K.  LaValhe,  E.  A.  DiBlasio,  S.  Kovacic,  K.  L.  Oran.,  P.  F.  Schendel,  and  J.  M.  McCoy. 
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site  within  the  codons  for  the  last  three  residues  of  the  Ub  moiety. In 
this  cloning  scheme,  an  ORF  of  interest  is  amplified  by  polymerase  chain 
reaction  (PCR)  and  a  primer  in  which  the  5'  extension  encodes  the  last 
three  residues  of  Ub.  Another  cloning  route  employs  double-stranded  oligo¬ 
nucleotides  with  cohesive  ends  that  are  used  to  join  the  DNA  frag¬ 
ments.^^  The  expression  of  a  resulting  Ub-A"-polypeptide  fusion  in  a  eu¬ 
karyotic  cell  (or  in  a  prokaryotic  cell  that  contains  the  S.  cerevisiae  Ubplp 
DUB)  yields  an  A^-polypeptide  bearing  a  predetermined  N-terminal  resi¬ 
due  X  (Figs.  1  and  2). 

In  their  natural  milieu,  proteins  of  biotechnological  or  pharmacological 
interest  are  often  products  of  the  secretory  pathway,  and  therefore  are 
cleaved  by  signal  peptidase  on  their  entrance  into  the  endoplasmic  reticulum 
(ER).  This  cleavage  frequently  yields  destabilizing  residues  at  the  N-termini 
of  these  proteins.  When  the  same  proteins  are  overexpressed  in  the  cytosol 
of  a  heterologous  bacterial  or  eukaryotic  host,  their  N-terminal  methionine 
tends  to  be  retained,  because  MAPs  cannot  cleave  off  N-terminal  methio¬ 
nine  if  it  is  followed  by  a  destabilizing  residue  (see  above).  It  is  in  these, 
quite  frequent,  cases  that  the  expression  of  a  protein  as  a  Ub-A"-protein 
fusion  attains  two  aims  at  once:  producing  a  protein  of  interest  bearing  the 
desired  N-terminal  residue  (Fig.  2)  and  also,  quite  often,  increasing  the 
yield  of  the  protein,  in  comparison  with  an  otherwise  identical  expression 
of  the  Ub-lacking  protein.^^ 

There  are  numerous  examples  of  Ub-mediated  increases  in  the  yield 
and  solubility  of  overexpressed  proteins.  For  instance,  a  conventional  heter¬ 
ologous  expression  of  the  Strepiomyces  tyrosinase  in  £.  coli  yielded  inactive 
enzyme,  whereas  expression  of  tyrosinase  as  a  Ub  fusion  resulted  in  an 
abundant  and  active  enzyme.^"^  Another  example  of  the  use  of  Ub  fusions 
in  E.  coli  was  an  abundant  expression  of  the  soluble  human  collagenase 
catalytic  domain.  In  contrast,  the  expression  of  the  same  protein  in  the 
absence  of  N-terminal  Ub  moiety  resulted  in  low  yield  and  insoluble  prod- 
uct.^^  A  60-fold  increase  in  the  yield  of  the  human  pi  class  glutathione 
transferase  GSTPl  was  observed  on  the  addition  of  a  Ub-coding  sequence 
to  the  GSTPl  ORF.^^  A  strong  increase  in  protein  yield  in  E.  coli  was 
reported  with  a  combination  of  the  T7  RNA  polymerase  promoter  system 
and  Ub  fusions.^^  Several  other  examples  of  the  Ub  fusion  approach  to 


^  K.  Han,  J.  Hong,  H.  C.  Lim,  C.  H.  Kim,  Y.  Park,  and  J.  M  Cho,  Ann.  N.  Y.  Acad.  Sci.  721, 
30  (1994). 
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protein  overexpression^**-^’"^’  are  described  in  an  earlier  review  by  Baker.-’^ 
More  recently,  Hondred  and  colleagues  applied  the  Ub  fusion  technique 
to  augment  protein  expression  in  transgenic  plants  ’" 


Ubiquitin-Assisted  Dissection  of  Protein  Translocation 
across  Membranes 

A  1994  method  called  UTA  (ubiquitin  translocation  assay)  employs  Ub 
as  a  kinetic  probe  in  the  context  of  signal  sequence-bearing  Ub  fusions.® 
After  emerging  from  ribosomes  in  the  cytosol,  a  protein  may  remain  in 
the  cytosol,  or  may  be  transferred  to  compartments  separated  from  the 
cytosolic  space  by  membranes.  With  a  few  exceptions,  noncytosolic  proteins 
begin  journeys  to  their  respective  compartments  by  crossing  membranes 
that  enclose  intracellular  organelles  such  as  the  ER  and  mitochondria  in 
eukaryotes  or  the  periplasmic  space  in  bacteria.  Amino  acid  sequences  that 
enable  a  protein  to  cross  the  membrane  of  a  compartment  are  often  located 
at  the  protein’s  N  terminus.  These  “signal”  sequences’'  are  targeted  by 
translocation  pathways  specific  for  each  compartment.  The  translocation 
of  a  protein  across  a  compartment  membrane  can  start  before  the  synthesis 
of  the  protein  is  completed,  resulting  in  docking  of  the  still  translating 
ribosome  at  the  transmembrane  channel.  The  UTA  technique  takes  advan¬ 
tage  of  the  rapid  (cotranslational)  cleavage  of  a  Ub  fusion  to  examine 
temporal  aspects  of  protein  transport  across  the  ER  membrane  in  living 
cells.®  Specifically,  if  a  Ub  fusion  that  has  been  engineered  to  bear  an  N- 
terminal  signal  sequence  (SS)  upstream  of  the  Ub  moiety  is  cleaved  in  the 
cytosol  by  DUBs,  the  fusion’s  reporter  moiety  would  fail  to  be  translocated 
into  the  ER.  Conversely,  if  a  nascent  SS  mediates  the  docking  of  a  translat¬ 
ing  ribosome  at  the  transmembrane  channel  rapidly  enough,  or  if  the  fusion 
Ub  moiety  is  located  sufficiently  far  downstream  of  the  SS,  then  by  the 
time  the  Ub  moiety  emerges  from  the  ribosome  the  latter  is  already  docked, 
and  the  nascent  Ub  moiety  enters  the  ER  before  it  can  fold  and/or  be 
targeted  by  DUBs.  Thus,  the  cleavage  at  the  Ub  moiety  of  an  SS-bearing 
Ub  fusion  in  the  cytosol  can  serve  as  an  in  vivo  kinetic  marker  and  a  tool 
for  analyzing  targeting  in  protein  translocation.®  The  temporal  sensitivity 
of  the  UTA  technique  stems  from  rapid  folding  of  the  nascent  Ub  moiety 


’’’  E.  A.  Sabin,  C.  T.  Lee-Ng,  J.  R.  Shuster,  and  P.  J.  Barr,  liio  Technology  7,  705  (l9Sy). 

'■'*  E.  Rian,  R.  Jemtiand,  O.  K.  Olstad,  J.  O.  (iordeladze,  and  K.  M.  Oautvik,  Eur.  J.  Biochem. 
213,  641  (1993). 
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that  precludes  its  translocation  and  makes  it  a  substrate  of  DUBs  in  the 
cytosol  shortly  after  the  emergence  of  the  fusion  Ub  moiety  from  the  ri¬ 
bosome. 


Spllt-Ubiqultln  Sensor  for  Detection  of  Protein-Protein  Interactions 

Another  Ub-based  method,  termed  the  split-Ub  sensor  or  USPS  (Ub/ 
split/protein/sensor),  makes  it  possible  to  detect  and  monitor  a  protein- 
protein  interaction  as  a  function  of  time,  at  the  natural  sites  of  this  interac¬ 
tion  in  a  living  cell7^  These  capabilities  of  the  split-Ub  technique  distinguish 
it  from  the  two-hybrid  assay.^^  The  design  of  a  split-Ub  sensor  is  based  on 
the  following  observations:  when  a  C-terminal  fragment  of  the  76-residue 
Ub  (Cub)  was  expressed  as  a  fusion  to  a  reporter  protein,  the  fusion  was 
cleaved  by  DUBs  only  if  an  N-terminal  fragment  of  Ub  (Nu^)  was  also 
expressed  in  the  same  cell.  This  reconstitution  of  native  Ub  from  its  frag¬ 
ments,  detectable  by  the  in  vivo  cleavage  assay,  was  not  observed  with  a 
mutationally  altered  Nui,,  However,  if  C^.h  and  the  altered  N,,!,  were  each 
linked  to  polypeptides  that  interact  in  vivo,  the  cleavage  of  the  fusion 
containing  Cub  was  restored,  yielding  a  generally  applicable  assay  for  kinetic 
and  equilibrium  aspects  of  the  in  vivo  protein  interactions.^^ 

Enhancement  of  Ub  reconstitution  by  interacting  polypeptides  linked 
to  fragments  of  Ub  stems  from  a  local  increase  in  concentration  of  one  Ub 
fragment  in  the  vicinity  of  the  other.  This  in  turn  increases  the  probability 
that  the  two  Ub  fragments  coalesce  to  form  a  quasinative  Ub  moiety,  whose 
(at  least)  transient  formation  results  in  the  irreversible  cleavage  of  the 
fusion  by  DUBs.  This  cleavage  can  be  detected  readily,  and  can  be  followed 
as  a  function  of  time  or  at  steady  state. Unlike  the  two-hybrid  method, 
which  is  based  on  the  apposition  of  two  structurally  independent  protein 
domains  whose  folding  and  functions  do  not  require  direct  interactions 
between  the  domains,  the  splil-Ub  assay  involves  reconstituting  the  confor¬ 
mation  of  a  small,  single-domain  protein.  Applications  of  the  split-Ub  sensor 
have  shown  that  this  assay  is  capable  of  detecting  transient  in  vivo  interac¬ 
tions  such  as  the  binding  of  a  signal  sequence  of  a  translocated  protein  to 
Sec62p,  a  component  of  the  ER  channel. Different  reporter  readouts  and 
selection-based  screens  have  been  devised  for  the  split-Ub  assay,  making 
it  possible  to  use  this  method  for  identifying  the  in  vivo  ligands  of  a  protein 


N.  Johnsson  and  A.  Varshavsky,  Proc.  Nad.  Acad.  Sci.  U.S.A.  91,  103^40  (1994) 
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of  interest,  similar  to  the  main  application  of  the  two-hybrid  assay.’^  Split- 
protein  sensors  analogous  to  split-Ub  but  employing  other  proteins,  such 
as  DHFR,  have  been  developed  as  well.'^’’'’ 


UPR  Technique 

Direct  measurements  of  the  in  vivo  degradation  of  intracellular  proteins 
require  a  pulse— chase  assay.  It  involves  the  labeling  of  nascent  proteins  for 
a  short  time  with  a  radioactive  precursor  (“pulse”),  the  termination  of 
labeling  through  the  removal  of  radiolabel  and/or  the  addition  of  a  transla¬ 
tion  inhibitor,  and  the  analysis  of  a  labeled  protein  of  interest  at  various 
times  afterward  (“chase”),  using  immunoprecipitation  and  sodium  dodecyl 
sulfate-polyacrylamide  gel  electrophoresis  (SDS-PAGE),  or  analogous 
techniques.  Its  advantage  of  being  direct  notwithstanding,  a  conventional 
pulse-chase  assay  is  fraught  with  sources  of  error.  For  example,  the  immu¬ 
noprecipitation  yields  may  vary  from  sample  to  sample;  the  volumes  of 
samples  loaded  on  a  gel  may  vary  as  well.  If  the  labeling  for  specific 
chase  times  is  done  with  separate  batches  of  cells  (as  is  the  case,  e.g.,  with 
anchorage-dependent  mammalian  cell  cultures),  the  efficiency  of  labeling 
is  yet  another  unstable  parameter  of  the  assay.  As  a  result,  pulse— chase 
data  tend  to  be  semiquantitative  at  best,  lacking  the  means  to  correct  for 
these  errors. 

A  robust  and  convenient  “internal  reference”  strategy  was  described 
in  1996.  This  strategy,  an  extension  of  the  original  Ub  fusion  method, 
was  termed  the  UPR  (ubiquitin/protein/reference)  technique.'''^  UPR  can 
compensate  for  several  sources  of  data  scatter  in  a  pulse-chase  assay 
(Fig.  3).  UPR  employs  a  linear  fusion  in  which  Ub  is  located  between  a 
protein  of  interest  and  a  reference  protein  moiety  (Fig.  3A).  The  fusion  is 
cotranslationally  cleaved  by  DUBs  after  the  last  residue  of  Ub,  producing 
equimolar  amounts  of  the  protein  of  interest  and  the  reference  protein 
bearing  the  C-terminal  Ub  moiety.  If  both  the  reference  protein  and  the 
protein  of  interest  are  immunoprecipitated  in  a  pulse-chase  assay,  the 
relative  amounts  of  the  protein  of  interest  can  be  normalized  against  the 
reference  protein  in  the  same  sample.“-^’'’‘'  The  UPR  technique  (Fig.  3) 

”  1.  Slagljar,  C.  Koroslcnsky,  N.  Joliiissoii,  and  S.  Ic  llecscn,  Proc.  Nall  Acad  Sci.  US  A. 

95.5187  (1998). 

S.  Willke,  N.  Lewke,  S.  Muller,  and  N.  Jolmsson,  Mol.  Biol.  Cell  1(1,  2519  (1999). 

”  1.  Remy  and  S,  W,  Michnick,  Fioc.  Natl.  Acad.  Sci.  U.S.A.  96,  5394  (1999). 

’U.  N.  Pelletier.  F.  X.  Campbell-Valois,  and  S.  W.  Michnick,  Pwc.  Nad.  Acad.  Sci.  U.S.A. 

95.  12141  (1998).  .  r  ,,c  , 

”  F.  Levy.  N.  Johnsson,  T.  Rumenapf,  and  A.  Varshavsky,  Pioc  Nad.  Acad.  Sci.  U  S  A.  73. 

4907  (1996). 
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can  thus  compensate  for  the  scatter  of  immunoprecipitation  yields,  sample 
volumes,  and  other  sources  of  sample-to-sample  variation.  1  he  increased 
accuracy  afforded  by  UPR  underscored  the  insufficiency  of  the  current 
“half-life”  terminology,  because  the  in  vivo  degradation  of  many  pro¬ 
teins  deviates  from  first-order  kinetics.  For  a  discussion  of  this  problem 
and  the  terminology  for  describing  nonexponential  decay,  see  Refs.  29 

and  79. 


Ublqultin  Sandwich  Technique 

Nascent  polypeptides  emerging  from  the  ribosome  may,  in  the  process 
of  folding,  present  degradation  signals  similar  to  those  recognized  by  the 
Ub  system  in  misfolded  or  otherwise  damaged  proteins.  It  has  been  a  long¬ 
standing  question  whether  a  significant  fraction  of  nascent  polypeptides  is 
cdtranslationally  degraded.  Determining  whether  nascent  polypeptides  are 
actually  degraded  in  vivo  has  been  difficult  because  at  any  given  time  the 
nascent  chains  of  a  particular  protein  species  are  of  different  sizes,  and 
therefore  would  not  form  a  band  on  electrophoresis  in  a  conventional 
pulse-chase  assay.  The  Ub  sandwich  technique’  makes  it  possible  to  detect 
cotranslational  protein  degradation  by  measuring  the  steady  state  ratio  of 
two  reporter  proteins  whose  relative  abundance  is  established  cotransla- 

tionally.  „  .  . 

Operationally,  the  Ub  sandwich  technique  is  a  three-protein  version 

of  the  UPR  assay.^’  A  polypeptide  to  be  examined  for  cotranslational 
degradation,  termed  B,  is  sandwiched  between  two  stable  reporter  domains 
A  and  C  in  a  linear  fusion  protein.  The  three  polypeptides  are  connected 
via  Ub  moieties  to  create  a  fusion  protein  of  the  form  AUb-BUb-CUb. 
The  independent  polypeptides  AUb,  BUb,  and  CUb  that  result  from  the 
cotranslational  cleavage  of  AUb-BUb-CUb  by  DUBs  are  called  modules. 
The  DUB-mediated  cleavage  establishes  a  kinetic  competition  between 
two  mutually  exclusive  events  during  the  synthesis  of  the  AUb-BUb-CUb 
fusion:  cotranslational  UBP  cleavage  at  the  BUb-CUb  junction  to  release 
the  long-lived  CUb  module  or,  alternatively,  cotranslational  degradation 
of  the  entire  BUb-CUb  nascent  chain  by  the  26S  proteasome.  In  the  latter 
case,  the  processivity  of  proteasome-mediated  degradation  results  in  the 
destruction  of  the  Ub  moiety  between  B  and  C  before  it  can  be  recognized 
by  UBPs.  The  resulting  drop  in  levels  of  the  CUb  module  relative  to  levels 
of  AUb,  referred  to  as  the  C/A  ratio,  reflects  the  cotranslational  degradation 
of  domain  B.  This  measurement  provides  a  minimal  estimate  of  the  total 
amount  of  cotranslational  degradation,  because  nonprocessive  cotransla- 
tional  degradation  events  that  do  not  extend  into  the  C  domain  are  not 
detected.  The  Ub  sandwich  method  was  used  to  demonstrate  that  more  than 
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50%  of  nascent  protein  molecules  bearing  an  N-degron  can  be  degraded 
cotranslationally  in  5.  cerevisiae,  never  reaching  their  mature  size  before 
their  destruction  by  processive  proteolysis.^ 

If  cotranslational  protein  degradation  by  the  Ub  system  is  found  to  be 
extensive  for  at  least  some  wild-type  proteins  (surveys  of  natural  proteins 
remain  to  be  carried  out  by  this  new  technique),  it  could  be  accounted 
for  as  an  evolutionary  trade-off  between  the  necessity  of  identifying  and 
destroying  degron-bearing  mature  proteins  and  the  mechanistic  difficulty 
of  distinguishing  between  posttranslationally  and  cotranslationally  pre¬ 
sented  degrons,  Cotranslational  protein  degradation  may  also  represent  a 
previously  unrecognized  form  of  protein  quality  control,  which  destroys 
nascent  chains  that  fail  to  fold  correctly.  These  and  other  questions  about 
physiological  aspects  of  the  cotranslational  protein  degradation  can  now 
be  addressed  directly  in  living  cells  through  the  Ub  sandwich  tech¬ 
nique,^ 

Concluding  Remarks 

The  Ub  fusion  technique  is  made  possible  by  the  ability  of  DUBs  to 
cleave  a  Ub  fusion  in  vivo  or  in  vitro  after  the  last  residue  of  Ub  irrespective 
of  the  flanking  sequence  context.  Since  its  development,  the  Ub  fusion 
technique  has  given  rise  to  a  number  of  applications  whose  common  feature 
is  utilization  of  the  rapid  and  highly  specific  cleavage  of  a  Ub-containing 
fusion  by  DUBs.  Among  these  applications  is  the  UPR  technique,  which 
increases  the  accuracy  of  pulse-chase  and  analogous  measurements. !  hope 
that  the  use  of  UPR  will  spread,  supplanting  the  conventional,  far  less 
accurate  pulse-chase  protocols  that  lack  a  reference  protein.  The  Ub  sand¬ 
wich  technique,  a  descendant  of  UPR,  has  made  it  possible  to  determine 
the  extent  of  cotranslational  protein  degradation  in  vivo  for  any  protein 
of  interest.  One  important  feature  of  the  Ub  moiety  is  its  ability,  as  a  part 
of  linear  fusions,  to  increase  the  yields  and  solubility  of  overexpressed 
proteins  or  short  peptides  in  either  eukaryotic  or  bacterial  hosts.  In  yet 
another  class  of  Ub-based  applications,  the  demonstrated  coalescence  of 
peptide-size  Ub  fragments  into  a  quasinalive  Ub  fold  has  yielded  the  split- 
Ub  sensor  for  detecting  protein  interactions  in  vivo.  Ub  fusions  continue 
to  be  useful  in  a  remarkable  variety  of  ways. 
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We  dissected  physical  and  functional  interactions 
between  the  ubiquitin-conjugating  (E2)  enzyme  Ubc2p 
and  Ubrlp,  the  E3  component  of  the  N-end  rule 
pathway  in  Saccharomyces  cerevisiae.  The  binding  of 
the  20  kDa  Ubc2p  by  the  225  kDa  Ubrlp  is  shown  to 
be  mediated  largely  by  the  basic  residue-rich  (BRR) 
region  of  Ubrlp.  However,  mutations  of  the  BRR 
domain  that  strongly  decrease  the  interaction  between 
Ubrlp  and  Ubc2p  do  not  prevent  the  degradation  of 
N-end  rule  substrates.  In  contrast,  this  degradation  is 
completely  dependent  on  the  RING-H2  finger  of  Ubrlp 
adjacent  to  the  BRR  domain.  Specifically,  the  first 
cysteine  of  RING-H2  is  required  for  the  ubiquitylation 
activity  of  the  Ubrlp-Ubc2p  complex,  although  this 
cysteine  plays  no  detectable  role  in  either  the  binding 
of  N-end  rule  substrates  by  Ubrlp  or  the  physical 
affinity  between  Ubrlp  and  Ubc2p.  These  results 
defined  the  topography  of  the  Ubc2p-Ubrlp  interaction 
and  revealed  the  essential  function  of  the  RING-H2 
finger,  a  domain  that  is  present  in  many  otherwise 
dissimilar  E3  proteins  of  the  ubiquitin  system. 
Keywords:  E2/E3/N-end  rule/proteasome/RING  finger/ 
ubiquitin 


Introduction 

Ubiquitin  (Ub)  is  a  16  residue  protein  whose  covalent 
conjugation  to  other  proteins,  usually  in  the  form  of 
a  multi-Ub  chain,  marks  these  proteins  for  processive 
degradation  by  the  26S  proteasome,  an  ATP-dependent 
multisubunit  protease  (Hochstrasser,  1996;  Varshavsky, 
1997;  Baumeister  et  al,  1998;  Hershko  and  Ciechanover, 
1998;  Scheffner  et  ai,  1998).  Through  either  constitutive 
or  conditional  degradation  of  many  intracellular  proteins, 
the  Ub-dependent  proteolytic  pathways  regulate  a  multi¬ 
tude  of  biological  processes,  including  the  cell  cycle,  cell 
differentiation,  apoptosis,  DNA  transcription,  replication 
and  repair,  signal  transduction,  functions  of  the  nervous 
system  and  stress  responses,  including  the  immune 
response  (Hicke,  1997;  Varshavsky,  1997;  Peters  et  aL, 
1998). 

The  conjugation  of  Ub  to  other  proteins  involves  the 
formation  of  a  thioester  between  the  C-terminus  of  Ub 
and  a  specific  cysteine  of  the  Ub-activating  (El)  enzyme. 
The  Ub  moiety  of  El~Ub  thioester  is  thereafter  transesteri- 
fied  to  a  specific  cysteine  in  one  of  several  Ub-conjugating 
(E2)  enzymes.  The  Ub  moiety  of  E2'-Ub  thioester  is 


conjugated,  via  the  isopeptide  bond,  to  the  8-amino  group 
of  either  a  substrate’s  Lys  residue  or  a  Lys  residue  of 
another  Ub  moiety,  the  latter  reaction  resulting  in  a 
substrate-linked  multi-Ub  chain  (Chau  et  al,  1989;  Pickart, 
1997;  Scheffner  et  al,  1998).  Most  E2  enzymes  function 
in  complexes  with  proteins  called  E3s.  The  functions  of 
E3s  include  the  initial  recognition  of  degradation  signals 
(degrons)  in  the  substrate  proteins,  with  different  E3s 
recognizing  different  classes  of  degrons.  At  least  some 
E3s,  specifically  those  containing  the  HECT  domain, 
accept  the  Ub  moiety  from  the  associated  E2~Ub  thioester, 
forming  an  E3~Ub  thioester  and  acting  as  a  proximal 
donor  of  the  Ub  moiety  to  substrates  selected  by  the  E3 
(Scheffner  et  al,  1995;  Nuber  and  Scheffner,  1999;  Wang 
et  al,  1999). 

One  pathway  of  the  Ub  system  is  the  N-end  rule  pathway 
(Bachmair  et  al,  1986;  Varshavsky,  1996).  Among  the 
targets  of  this  pathway  are  proteins  bearing  destabilizing 
N-terminal  residues.  In  the  yeast  Saccharomyces  cerevis- 
iae,  Asn  and  Gin  are  tertiary  destabilizing  N-terminal 
residues,  in  that  they  function  through  their  conversion,  by 
the  V7A7-encoded  N-terminal  amidase,  into  the  secondary 
destabilizing  N-terminal  residues  Asp  and  Glu.  The  desta¬ 
bilizing  activity  of  N-terminal  Asp  and  Glu  requires  their 
conjugation,  by  the  ATEi -encoded  Arg-tRNA-protein 
transferase,  to  Arg,  one  of  the  primary  destabilizing 
residues  (Balzi  et  al,  1990;  Baker  and  Varshavsky,  1995). 
In  mammals,  the  deamidation  step  is  bifurcated,  in  that 
two  distinct  amidases  specific,  respectively,  for  N-terminal 
Asn  or  Gin,  mediate  the  activity  of  tertiary  destabilizing 
residues  (Stewart  et  al,  1995;  Grigoryev  et  al,  1996). 
The  primary  destabilizing  N-terminal  residues  are  bound 
directly  by  the  U5R7 -encoded  E3  (N-recognin),  the  recog¬ 
nition  component  of  the  N-end  rule  pathway.  In  S.cerevis- 
iae,  Ubrlp  is  a  225  kDa  protein  that  binds  to  potential 
N-end  rule  substrates  through  their  primary  destabilizing 
N-terminal  residues  Phe,  Leu,  Trp,  Tyr,  He,  Arg,  Lys  and 
His  (Bartel  et  al,  1990).  Ubrlp  contains  at  least  three 
substrate-binding  sites.  The  type  1  site  is  specific  for  basic 
N-terminal  residues  Arg,  Lys  and  His.  The  type  2  site  is 
specific  for  the  hydrophobic  residues  Phe,  Leu,  Trp,  Tyr 
and  He  (Reiss  et  al,  1988;  Gonda  et  al,  1989;  Baker  and 
Varshavsky,  1991).  Ubrlp  also  targets  short-lived  proteins 
such  as  Cup9p  and  Gpalp,  which  lack  destabilizing 
N-terminal  residues,  and  are  recognized  by  Ubrlp  through 
its  third  substrate-binding  site,  the  exact  location  and 
specificity  of  which  remain  to  be  determined  (Madura  and 
Varshavsky,  1994;  Byrd  et  al,  1998).  Similar  but  distinct 
versions  of  the  N-end  rule  pathway  are  present  in  all 
organisms  examined,  from  bacteria  to  mammals.  For  a 
summary  of  the  currently  known  functions  of  this  pathway, 
see  Kwon  et  al  (1999a,b). 

The  roles  of  E3  proteins  in  the  E2,  E3 -dependent 
ubiquitylation  of  substrates  are  not  understood  in  detail. 
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RING-H2  finger  and  E3  function 


It  has  been  shown  that  the  UBRl  -mcodcd  N-recognin, 
the  E3  of  the  N-end  rule  pathway,  functions  in  a  complex 
with  Ubc2p  (Rad6p)  (Madura  et  al,  1993;  Watkins  et  aL, 
1993),  one  of  13  E2  enzymes  in  S.cerevisiae  (Hochstrasser, 
1996).  The  poly  acidic  C- terminal  tail  of  Ubc2p  was  found 
to  be  essential  for  the  binding  of  Ubc2p  to  Ubrlp,  but  the 
Ubc2p-binding  site  of  Ubrlp  remained  unknown.  The 
mechanism  of  cooperation  between  E3  (Ubrlp)  and  E2 
(Ubc2p)  in  forming  a  substrate-linked  multi-Ub  chain  is 
unknown  for  either  this  or  other  Ub-dependent  pathways. 

We  report  two  main  results,  (i)  The  Ubrlp-Ubc2p 
interaction  is  mediated  by  the  basic  residue-rich  (BRR) 
region  of  Ubrlp  and  the  polyacidic  tail  of  Ubc2p.  Interes¬ 
tingly,  this  high-affinity  interaction  is  not  essential  for  the 
activity  of  the  N-end  rule  pathway,  (ii)  The  RING-H2 
domain  of  Ubrlp,  adjacent  to  the  BRR  region,  is  strictly 
required  for  ubiquitylation  of  N-end  rule  substrates,  even 
though  this  domain  is  not  required  for  the  binding  of 
Ubrlp  to  either  Ubc2p  or  N-end  rule  substrates.  Because 
the  RING-H2  and  similar  RING  variants  are  a  character¬ 
istic  feature  of  many  otherwise  dissimilar  E3  proteins,  the 
ubiquity lation-enabling  function  of  the  Ubrlp  RING-H2 
identified  in  the  present  work  is  likely  to  be  a  general 
property  of  E3  components  in  the  Ub  system. 

Terminology 

Ubiquitin  whose  C-terminal  (Gly76)  carboxyl  group  is 
covalently  linked  to  another  compound  is  called  the 
ubiquity  I  moiety,  the  derivative  terms  being  ubiquitylation 
and  ubiquitylated.  The  term  Ub  refers  to  both  free  ubiquitin 
and  the  ubiquityl  moiety.  This  nomenclature,  which  is 
also  recommended  by  the  Nomenclature  Committee  of 
the  International  Union  of  Biochemistry  and  Molecular 
Biology,  brings  Ub-related  terms  in  line  with  the  standard 
chemical  terminology  (Varshavsky,  1997). 

Results 

Dissection  of  the  Ubr1p-Ubc2p  interaction 

To  search  for  Ubc2p-interacting  proteins,  we  employed 
the  two-hybrid  technique  (Fields  and  Song,  1989;  James 
et  aL,  1996),  using  the  20  kDa  Ubc2p  as  bait.  A  plasmid 
thus  identified  contained  a  fragment  of  the  S.cerevisiae 
open  reading  frame  (ORF)  YLR024C,  which  was  previ¬ 
ously  termed  UBRl  (Hochstrasser,  1996;  Kwon  et  aL, 
1998),  because  the  predicted  sequence  of  the  216  kDa 
Ubr2p  was  22%  identical  and  46%  similar  to  that  of  the 
225  roa,  USRi -encoded  N-recognin,  the  E3  of  the  N-end 
rule  pathway  (Bartel  et  aL,  1990).  Ubr2p  is  not  essential 
for  cell  viability  under  standard  growth  conditions  and 
does  not  target  N-end  rule  substrates,  but  does  compete 
with  Ubrlp  for  the  binding  to  Ubc2p  (H.Rao,  Y.Xie  and 
A.Varshavsky,  unpublished  data). 

The  cloned  Ubc2p-interacting  region  of  Ubr2p  was  a 
287  residue  fragment  (positions  1134-1420).  Sequence 
comparisons  revealed  two  distinct  motifs  conserved 
between  Ubrlp  and  Ubr2p  in  this  region  (Figure  1  A).  One 
was  a  basic  residue-rich  region,  termed  BRR,  and  the 
other  a  RING-H2  finger  (Saurin  et  aL,  1996;  Kwon  et  aL, 
1998).  The  RING-H2  motif  is  a  distinct  variant  of  the 
previously  defined  Cys/His-rich  RING  motif,  which  is 
thought  to  be  involved  in  protein-protein  interactions 
(Borden  and  Freemont,  1996;  Inouye  et  aL,  1997).  Two- 


hybrid  analyses  indicated  that  Ubc2p  interacted  with  the 
Ubr2p-homologous  region  of  Ubrlp  as  well  (positions 
1081-1367)  (Figure  1C  and  D).  In  addition,  two-hybrid 
assays  with  a  series  of  Ubrlp  fragments  fused  to  the 
Gal4p  transcriptional  activation  domain  demonstrated  that 
the  287  residue  region  was  the  only  detectable  Ubc2p- 
binding  site  in  Ubrlp  (data  not  shown).  We  used  site- 
directed  mutagenesis  to  dissect  the  Ubc2p-Ubrlp  inter¬ 
action  in  this  region  of  Ubrlp.  Two  sets  of  mutants  were 
constructed  (Figure  IB).  The  mutants  of  one  set,  termed 
MB  (mutation  in  BRR),  bore  a  missense  mutation(s)  in 
the  BRR  domain;  the  mutants  of  another  set,  termed  MR 
(mutation  in  RING-H2)  bore  either  Cys— >Ser  or  His^Ala 
mutations  in  the  RING-H2  finger.  These  derivatives  of 
the  287  residue  region  of  Ubrlp  were  tested  for  their 
binding  to  Ubc2p  using  a  version  of  the  two-hybrid  assay 
that  utilized  two  reporter  genes,  ADE2  and  HISS  (James 
et  aL,  1996).  This  design  increased  the  assay’s  affinity 
range,  because  a  strong  two-hybrid  interaction  was 
required  to  confer  on  the  tester  cells  the  Ade"^  phenotype, 
whereas  a  weak  two-hybrid  interaction  sufficed  to  make 
the  same  tester  cells  His^  (James  et  aL,  1996). 

The  replacement  of  a  single  basic  residue  at  various 
positions  in  the  BRR  domain  of  Ubrlp  with  uncharged 
residues  (e.g.  the  alleles  MBIO,  MBll  and  MB  17)  sig¬ 
nificantly  weakened  the  interaction  of  Ubc2p  with  the 
287  residue  fragment  of  Ubrlp,  as  indicated  by  the  Ade" 
His"^  phenotype  of  the  tester  strain,  in  comparison  with 
the  Ade^  His"^  phenotype  observed  with  the  fragment’s 
wild-type  (wt)  version  (Figure  1C).  When  two  basic 
residues  of  the  BRR  domain  were  converted  to  uncharged 
residues  (MB4  allele),  the  resulting  tester  strain  was  still 
His"^,  but  grew  more  slowly  on  histidine-lacking  medium 
than  did  the  tester  strain  bearing  single-substitution  Ubrlp 
mutants  (Figure  1C;  data  not  shown),  suggesting  a  further 
weakening  of  the  Ubc2p-Ubrlp  interactions.  With  triple¬ 
substitution  mutants  in  the  BRR  domain  of  Ubrlp  (MB5 
and  MB6),  the  affinity  of  Ubc2p  for  the  287  residue  region 
of  Ubrlp  decreased  to  levels  undetectable  with  the  two- 
hybrid  assay  (Figure  1C). 

In  contrast  to  these  findings  with  the  BRR  domain, 
none  of  the  four  single  Cys— >Ser  substitutions  in  the 
adjacent  RING-H2  finger  of  Ubrlp  impaired  the  interaction 
of  Ubc2p  with  the  287  residue  region  of  Ubrlp  (Figure  1C, 
mutants  MR1-MR3  and  MR18).  A  His^Ala  mutation  in 
the  RING-H2  finger  was  also  without  effect  (Figure  1C, 
mutant  MR  12).  Moreover,  even  the  simultaneous 
Cys^Ser  alterations  at  two  positions  of  the  RING-H2 
finger  failed  to  affect  the  Ade'^  His^  phenotype  of  the 
tester  strain  (Figure  1C,  mutants  MR7  and  MR8).  The 
287  residue  region  of  Ubrlp  was  then  subcloned  into  two 
fusions,  which  contained,  respectively,  the  BRR  domain 
alone  (positions  1081-1220)  and  the  RING-H2  finger 
alone  (positions  1177-1367).  The  RING-H2  finger  alone 
did  not  exhibit  affinity  to  Ubc2p  (Figure  ID).  In  contrast, 
the  binding  of  Ubc2p  to  the  BRR  domain  alone  was 
readily  detectable,  but  this  interaction  was  weakened  in 
comparison  with  the  interaction  between  Ubc2p  and  the 
287  residue  region  of  Ubrlp  containing  both  BRR  and 
RING-H2  (Figure  ID). 

Taken  together,  these  results  indicated  that  the  positively 
charged  BRR  domain  is  the  main  region  mediating  the 
binding  of  Ubrlp  to  Ubc2p,  through  largely  electrostatic 
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Fig.  1.  Ubc2p  binds  to  the  BRR  domain  of  Ubrlp.  (A)  Sequence  comparison  of  the  Ubc2p-binding  domains  of  S.cerevisiae  Ubrlp  and  Ubr2p. 
Identical  residues  are  highlighted  in  black;  similar  residues  are  shaded.  The  conserved  and  similar  basic  residues  are  marked  by  ‘  +  ’.  The  Cys  and 
His  residues  of  the  RING-H2  motif  are  denoted  by  asterisks.  (B)  Schematic  representation  of  mutations  in  the  BRR  and  RING-H2  domains  of 
Ubrlp.  The  287  residue  region  of  the  1950  residue  Ubrlp  (positions  1081-1367)  was  divided  into  two  regions  containing  the  BRR  domain  and  the 
RING-H2  finger.  Mutations  of  basic  residues,  termed  the  MB  mutations,  in  the  BRR  domain  were  at  positions  1165-1175  and  are  underlined  in  bold. 
Mutations  of  either  Cys  or  His  residues  in  the  RING-H2  finger,  termed  the  MR  mutations,  were  at  positions  1220-1323  and  are  underlined  in  bold. 
The  positions  of  mutated  Cys  and  His  residues  within  RING-H2  are  listed  in  parentheses.  (C)  Summary  of  the  two-reporter  {HIS3  and  ADE2) 
two-hybrid  results  with  the  MB  and  MR  mutants  of  Ubrlp  versus  wt  Ubc2p.  See  the  text  and  Materials  and  methods  for  details.  (D)  wt  Ubc2p  and 
its  two  truncated  alleles,  Ubc2pi_i49  and  Ubc2pio_i72>  were  assayed  in  the  two-reporter/two-hybrid  system  for  their  interaction  with  either  the 
287  residue  Ubrlp  fragment,  or  its  BRR  domain  alone,  or  its  RING-H2  domain  alone.  The  9  residue  N-terminal  region  and  the  polyacidic 
C-terminal  tail  of  Ubc2p  are  denoted  by  the  black  and  hatched  boxes,  respectively. 


interactions  of  the  BRR  region  with  the  polyacidic 
C-terminal  tail  of  Ubc2p  (see  below).  These  findings  also 
indicated  that  the  presence  of  the  RING-H2  finger  near 
the  BRR  domain  was  not  essential  for  the  binding  of 
Ubc2p  but  was  required  (in  an  undefined  way;  but  see 
below)  for  the  high-affinity  Ubrlp-Ubc2p  interaction 
(Figure  IB-D).  In  addition,  either  single  or  double 
Cys^Ser  substitutions  in  the  RING-H2  finger  did  not 
appear  to  result  in  a  gross  conformational  alteration  of 
the  BRR/RING-H2  region  of  Ubrlp,  as  indicated  by  the 
undiminished  affinity  of  mutants  in  this  region  for  Ubc2p 
(Figure  1C);  further  evidence  for  this  is  described  below. 

Previous  work  has  shown  that  both  the  N-terminal 
9  residue  region  and  the  23  residue  polyacidic  C-terminal 
tail  of  Ubc2p  are  required  for  the  binding  of  Ubc2p  to 
Ubrlp  (Madura  et  al,  1993;  Watkins  et  al,  1993).  To 
determine  the  relative  contributions  of  these  Ubc2p  regions 
to  the  interaction  between  Ubc2p  and  the  BRR  domain  of 


Ubrlp,  we  constructed  two  hybrid  fusions  containing  two 
truncated  alleles  of  UBC2  which  encoded,  respectively, 
Ubc2pio_n2  (lacking  residues  1-9)  and  Ubc2pi_i49  (lacking 
the  polyacidic  tail).  These  derivatives  of  Ubc2p  were 
tested  in  the  two-hybrid  assay  for  binding  to  the  entire 
287  residue  region  of  Ubrlp,  or  to  the  BRR  domain  alone, 
or  to  the  RING-H2  finger  alone.  The  deletion  of  the 
polyacidic  C-terminal  tail  of  Ubc2p  abolished  its  inter¬ 
action  with  the  BRR  domain  of  Ubrlp,  whereas  the 
deletion  of  the  first  9  residues  of  Ubc2p  weakened  but 
did  not  abolish  this  interaction  (Figure  ID).  Interestingly, 
the  latter  interaction,  weakened  but  readily  detectable,  was 
independent  of  the  presence  of  the  RING-H2  finger 
(Figure  ID).  Thus,  the  bulk  of  the  affinity  of  Ubc2p  for 
the  287  residue  fragment  of  Ubrlp  stems  from  electrostatic 
interactions  between  the  polyacidic  C-terminal  tail  of 
Ubc2p  and  the  positively  charged,  --140  residue  BRR 
domain  of  the  1950  residue  Ubrlp.  (The  size  of  a 
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Fig.  2.  The  BRR  domain  of  Ubrlp  is  essential  for  high-affinity 
binding  to  Ubc2p.  (A)  GST  pull-down  assays  with  wt  Ubrlp  and  its 
mutants.  FLAG-Ubrlp  (denoted  as  wt  UBRl)  (lane  2),  or  FUAG- 
Ubrlp*^'^'  (lane  3)  or  FLAG-Ubrlp'^'^^  (lane  4),  or  FLAG-Ubrlp“'** 
(lane  5)  were  overexpressed  in  AVY301  (ubrlA  ubc2A)  cells  (see 
Materials  and  methods).  Cells  were  labeled  with  [^^S] methionine  for 
30  min  at  30°C,  and  the  extracts  were  assayed  using  glutathione- 
agarose  beads  bound  with  GST-Ubc2p  (top  panel)  or  applied  to 
immunoprecipitation  with  anti-FLAG  monoclonal  antibody  (lower 
panel).  (B)  In  vivo  association  of  Ubc2p  with  wt  Ubrlp,  Ubrlp^^^ 
and  Ubrlp^^^,  but  not  with  Ubrlp^®^.  Cell  extracts  from 
transformants  expressing  FLAG-Ubrlp  alone  (lane  1)  or  the 
N-terminally  HA-tagged  Ubc2p  (HA-Ubc2p)  alone  (lane  2),  or 
co-expressing  HA-Ubc2p  with  either  FLAG-Ubrlp  (lane  3)  or 
FLAG-Ubrlp^^^  (lane  4),  FLAG-Ubrlp^^^  (lane  5)  or  FLAG- 
Ubrlp^*^^  (lane  6)  were  incubated  with  anti-HA  antibody  (see 
Materials  and  methods).  Co-precipitated  wt  Ubrlp  and  its  mutant 
derivatives  were  resolved  by  SDS-PAGE,  followed  by  immunoblotting 
with  anti-FLAG  antibody  (top  panel).  The  asterisk  marks  a  cross¬ 
reactive  band.  The  expression  levels  of  Ubrlp  and  Ubc2p  in  these 
transformants  were  examined  by  immunoblotting  with,  respectively, 
anti -FLAG  (middle  panel)  and  anti-HA  antibodies  (bottom  panel). 
SDS-PAGE  was  carried  out  in  8%  gels. 


contiguous  region  within  the  BRR  domain  that  is  required 
for  the  binding  of  Ubc2p  is  likely  to  be  significantly  less 
than  140  residues.) 

To  determine  whether  basic  residues  of  the  BRR  domain 
and  the  RING-H2  finger  were  essential  for  the  binding  of 
the  full-length  Ubrlp  to  Ubc2p,  we  first  employed  a 
pull-down  assay  with  a  fusion  of  Ubc2p  to  glutathione 
5- transferase  (GST).  Specifically,  wt  FLAG-Ubrlp  (tagged 
at  the  N-terminus  with  the  FLAG  epitope),  or  FLAG- 
Ubrlp^®^  (a  triple-substitution  mutant  in  the  BRR  domain; 
Figure  IB),  or  FLAG-Ubrlp'^^*  or  FLAG-Ubrlp’^’^^  (a 
single  Cys— >Ser  substitution  at  Cysl  and  Cys5,  respect¬ 
ively;  Figure  IB)  was  overexpressed  in  a  ubrliA  ubc2A 
S.cerevisiae  strain  and  labeled  with  [^^Sjmethionine.  The 
N-terminal  FLAG  epitope  did  not  impair  the  function  of 
Ubrlp  (A.Webster  and  A.Varshavsky,  unpublished  data). 
The  extracts  from  labeled,  Ubrlp-overexpressing  cells 
(and  the  extract  from  cells  carrying  vector  alone)  were 
incubated  with  glutathione-agarose  beads  loaded  with 
GST-Ubc2p,  and  the  amounts  of  ^^S -labeled  FLAG- 
Ubrlp  and  Ubrlp  mutants  recovered  from  the  beads  were 
determined  by  SDS-PAGE  (Figure  2A,  top  panel).  In 
parallel,  the  same  extracts  were  immunoprecipitated  with 
a  monoclonal  anti-FLAG  antibody,  to  verify  equality  of 
the  initial  inputs  of  FLAG-Ubrlp  and  its  mutants 
(Figure  2A,  lower  panel).  In  agreement  with  the  results 
of  the  two-hybrid  assays  (Figure  IB  and  C),  the  binding 


of  FLAG-Ubrlp^®^  to  Ubc2p  was  barely  detectable 
(Figure  2A,  top  panel,  lane  5),  whereas  the  binding  of 
either  FLAG-Ubrlp^®^  or  FLAG-Ubrlp^®^  to  Ubc2p  was 
comparable  to  that  of  wt  FLAG-Ubrlp  (Figure  2A,  top 
panel,  lanes  2-4). 

We  also  examined  the  in  vivo  association  of  wt  Ubrlp 
and  its  mutants  with  Ubc2p,  using  co-immunoprecipi- 
tation-immunoblot  assays.  Either  wt  FLAG-Ubrlp, 
FLAG-UbrlpM®^  FLAG-Ubrlp^®^  or  FLAG-Ubrlp^®^ 
was  co-overexpressed  in  a  ubrlA  ubc2A  S.cerevisiae 
with  Ubc2p  bearing  the  N-terminal  hemagglutinin  (HA) 
epitope.  Controls  included  congenic  cells  expressing 
FLAG-Ubrlp  alone  or  Ubc2p  alone.  Proteins  were 
immunoprecipitated  from  cell  extracts  with  anti-HA  anti¬ 
body,  followed  by  SDS-PAGE  and  immunoblotting  with 
anti-FLAG  antibody  (Figure  2B,  top  panel).  The  expres¬ 
sion  levels  of  Ubrlp  and  Ubc2p  in  these  transformants 
were  monitored  by  immunoblotting  the  initial  extracts 
with  either  anti-FLAG  antibody  (Figure  2B,  middle  panel) 
or  anti-HA  antibody  (Figure  2B,  bottom  panel).  In  agree¬ 
ment  with  the  results  of  two-hybrid  and  pull-down  assays, 
the  in  vivo  association  of  FLAG-Ubrlp^®^  with  Ubc2p 
was  undetectable  (lane  4),  whereas  both  FLAG-Ubrlp^®^ 
and  FLAG-Ubrlp^®^  bound  Ubc2p  as  efficiently  as  wt 
FLAG-Ubrlp  (lanes  3,  5  and  6).  Thus,  the  BRR  domain 
of  Ubrlp  is  essential  for  the  binding  of  Ubrlp  to  Ubc2p, 
whereas  the  adjacent  RING-H2  finger  of  Ubrlp  is  not 
directly  involved  (Figures  IB,  C  and  2). 

High  affinity  between  Ubrlp  and  Ubc2p  is  not 
essential  for  degradation  of  N-end  rule  substrates 

We  asked  whether  the  MB-type  point  mutations  in  the 
BRR  domain  of  Ubrlp  that  virtually  abolished  the  binding 
of  Ubrlp  to  wt  Ubc2p  (Figures  IB,  C  and  2)  affected  the 
activity  of  the  N-end  rule  pathway,  wt  Ubrlp  and  the 
BRR-domain  mutants  Ubrlp^®"^  and  Ubrlp^®^  (Figure  1C 
and  D)  were  expressed  in  ubrlA  cells  from  a  low-copy 
plasmid  and  the  native  Pubri  promoter.  The  ubrlA  cells 
also  expressed  an  N-end  rule  substrate,  either  Arg-^gal  or 
Leu-pgal,  which  are  short-lived  in  UBRl  cells  (?i/2  of  ~2 
and  ~3  min,  respectively)  (Varshavsky,  1996).  Arg-pgal 
and  Leu-Pgal  were  expressed  as  Ub-Arg-Pgal  and  Ub- 
Leu-pgal,  which  were  co-translationally  deubiquitylated 
in  vivo  (Varshavsky,  1996).  Previous  work  has  also  shown 
that  the  enzymatic  activity  of  an  N-end  rule  substrate 
such  as  X-Pgal  is  a  sensitive  measure  of  the  substrate’s 
metabolic  stability  in  vivo  (Madura  et  ah,  1993;  Kwon 
et  ah,  1999a). 

Both  Arg-pgal  and  Leu-pgal  were  long-lived  in  vivo  in 
the  absence  of  Ubrlp  (ti/2  >20  h)  (‘vector’  bars  in 
Figure  3A;  data  not  shown).  As  expected,  Arg-Pgal  and 
Leu-pgal  were  degraded  in  the  presence  of  wt  Ubrlp 
(Figure  3A).  Surprisingly,  the  steady-state  levels  of  Arg- 
pgal  and  Leu-Pgal  in  the  transformants  with  Ubrlp^®^  or 
Ubrlp^®^  were  similar  to  (or  only  slightly  higher  than) 
the  levels  of  these  proteins  in  transformants  expressing 
wt  Ubrlp  (Figure  3A),  indicating  that  high-affinity  binding 
of  Ubrlp  to  Ubc2p  is  not  essential  for  the  activity  of  the 
N-end  rule  pathway.  Control  experiments  with  ubc2A  cells 
confirmed  that  the  Ubrlp^®^-dependent  degradation  of 
N-end  rule  substrates  was  Ubc2p-dependent  as  well  (data 
not  shown). 

Previous  work  with  the  tailless  Ubc2p  1^149  mutant. 
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Fig.  3.  High-affinity  interaction  between  Ubrlp  and  Ubc2p  is  not  essential  for  the  activity  of  the  N-end  rule  pathway.  (A)  The  activity  of  Ubrlp  is 
weakly  affected  by  MB  mutations.  All  UBRl  alleles  were  expressed  from  the  Pubri  promoter  in  a  low-copy  vector.  Constructs  expressing  wt  Ubrlp, 
UbrlpMB4  Qj.  ubrlp^^^,  or  the  empty  vector  were  transformed  into  JD55  (ubrlA)  cells  expressing  Arg-Pgal  or  Leu-pgal.  Measurements  of  Pgal 
activity  were  used  to  assay  the  degradation  of  Arg-pgal  and  Leu-pgal.  The  values  shown  are  the  means  of  duplicate  measurements  of  three 
independent  transformants.  Standard  deviations  are  indicated  above  the  bars.  (B  and  C)  High-affinity  association  of  Ubrlp  and  Ubc2p  is  not  essential 
for  activity  of  the  N-end  rule  pathway,  pgal  activity  assays  (B)  and  immunoprecipitations  with  anti-Pgal  antibody  (C)  were  used  to  compare  the 
degradation  of  model  substrates,  either  Arg-Pgal  (B  and  C)  or  Tyr-Ppl  (B).  All  UBRJ  and  UBC2  alleles  were  expressed  from  the  native  Pubri  and 
PuBC2  promoters,  respectively,  in  a  low-copy  vector.  Different  combinations  of  UBRl  and  UBC2  alleles  were  transformed  into  ubrlA  ubc2A  cells 
expressing  either  Arg-pgal,  Tyr-Pgal  (B)  or  DHFRha-Ub^^^-Arg-pgal  (C)  (see  the  text  and  Materials  and  methods).  The  measured  ratio  of  Arg-Pgal 
to  DHFR-Ub  (the  reference  protein)  in  the  absence  of  Ubrlp  [(C),  lane  1]  was  set  at  1.00  and  used  to  normalize  the  other  ratios. 


which  did  not  bind  to  wt  Ubrlp,  led  to  the  conclusion 
that  Ubc2pi_i49  did  not  support  the  degradation  of  N-end 
rule  substrates  (Madura  et  aL,  1993),  and  implied  that 
high-affinity  binding  of  Ubc2p  to  Ubrlp  was  strictly 
essential  for  the  pathway’s  function.  However,  another 
study  reported  a  diminished  but  significant  activity 
Ubc2pi_j49  (Watkins  et  aL,  1993).  The  latter  conclusion 
was  consistent  with  our  findings  about  the  BRR-region 
mutants  of  Ubrlp  (Figure  3  A).  To  address  this  discrepancy, 
we  compared  the  activity  of  the  N-end  rule  pathway  in 
ubrlA  ubc2A  cells  expressing  different  combinations  of 
either  wt  Ubrlp  or  Ubrlp^®^  with  either  wt  Ubc2p  or  the 
tailless  Ubc2p  (Ubc2pi_i49).  UBRl,  UBC2  and  their  mutant 
alleles  were  expressed  from  their  native  promoters  and  a 
low-copy  plasmid.  The  relative  rates  of  degradation  of 
Arg-Pgal  and  Tyr-Pgal  {tyi  of  -2  and  ~10  min,  respect¬ 
ively)  (Bachmair  et  al.,  1986;  Varshavsky,  1996)  were 
assayed  by  measuring  the  activity  of  Pgal  (Figure  3B). 
With  wt  Ubc2p,  the  activity  of  the  N-end  rule  pathway 
did  not  change  significantly  between  the  wt  Ubrlp  and 
UbripMB6  alleles  (Figure  3B).  Furthermore,  in  the  presence 
of  wt  Ubrlp,  the  pathway’s  activity  decreased  significantly 
but  not  strongly  upon  the  replacement  of  wt  Ubc2p  with 
its  tailless  derivative  Ubc2pi_i49  (Figure  3B),  in  agreement 
with  the  finding  of  Watkins  et  al.  (1993)  but  in  contrast 
to  the  result  of  Madura  et  al.  (1993).  Interestingly,  the 
activity  of  the  N-end  rule  pathway  was  further  decreased, 
but  still  not  abolished,  by  combining  Ubrlp^®^  and 
Ubc2pi_i49  (Figure  3B). 

To  measure  the  relative  metabolic  stability  of  Arg-pgal 
in  a  different  way,  we  utilized  a  recently  developed 
UPR  (Ub/protein/reference)  technique,  which  provides  a 
reference  protein  (Levy  et  al,  1996,  1999).  This  method 
employs  a  linear  fusion  such  as  DHFRha-Ub*^'^^-X-Pgal, 
in  which  Ub^"^^,  containing  Arg  instead  of  Lys  at  position 
48,  is  placed  between  a  protein  of  interest  (X-Pgal)  and 
a  reference  protein  moiety  such  as  DHFRha  (mouse 


dihydrofolate  reductase  bearing  the  HA  epitope  tag)  (Levy 
et  al,  1996).  DHFRha-Ub^^^-X-pgal  is  co-translationally 
cleaved  by  Ub-specific  deubiquitylating  enzymes  (DUBs) 
(Wilkinson  and  Hochstrasser,  1998)  after  the  last  residue 
of  Ub^"^^,  producing  equimolar  amounts  of  X-Pgal  and 
DHFRha-Ub^^^.  This  way,  the  relative  amounts  of  X-pgal 
can  be  normalized  against  the  (co-immunoprecipitated) 
DHFRha-Ub^"^^  reference  protein  in  the  same  sample.  The 
UPR  technique  can  therefore  compensate  for  the  scatter 
of  immunoprecipitation  yields,  sample  volumes,  and  other 
sources  of  sample-to-sample  variation  (Levy  et  al,  1996). 
ubrlA  ubc2A  cells  expressing  DHFRha-Ub^"^^- Arg- Pgal, 
and  wt  Ubrlp  or  Ubrlp^®^  with  wt  Ubc2p  or  Ubc2pi„i49 
were  labeled  with  [^^S]methionine  for  30  min  at  30°C. 
Cell  extracts  were  immunoprecipitated  with  both  anti-Pgal 
and  anti-HA  antibodies,  followed  by  SDS-PAGE  and  the 
determination,  using  a  Phosphorlmager,  of  the  ratio  of 
Arg-Pgal  (test  protein)  to  DHFRha-Ub^"^^  (reference  pro¬ 
tein)  (Figure  3C).  The  ratio  determined  for  U5R7 -lacking 
cells  was  arbitrarily  set  at  1.00,  and  was  used  to  normalize 
the  same  ratios  from  other  transformants.  The  results 
(Figure  3C)  were  in  agreement  with  those  obtained  by 
measuring  the  enzymatic  activity  of  Pgal  (Figure  3A  and 
B).  Specifically,  in  the  presence  of  wt  Ubc2  and  Ubrlp^®^, 
-90%  of  Arg-pgal  was  degraded  under  the  assay’s  condi¬ 
tions,  similar  to  the  result  with  wt  Ubc2p  and  wt  Ubrlp. 
In  cells  expressing  wt  Ubrlp  and  Ubc2pi_j49,  -70%  of 
Arg-Pgal  was  degraded.  With  Ubrlp^®^  and  Ubc2pi_i49 
the  degradation  of  Arg-Pgal  was  decreased  further,  to 
-50%,  but  still  not  abolished  (Figure  3C). 

Taken  together,  these  results  (Figure  3)  indicated  that 
high-affinity  interactions  between  the  E3  Ubrlp  and  the 
E2  Ubc2p  are  essential  for  normal  levels  of  the  pathway’s 
activity.  These  data  also  showed  that  even  much  weaker 
interactions  between  mutant  variants  of  Ubrlp  and  Ubc2p, 
the  interactions  that  could  not  be  detected  by  either  two- 
hybrid  or  pull-down  assays  (Figures  1  and  2),  were  still 
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sufficient  to  sustain  a  large  fraction  of  activity  of  the 
N-end  rule  pathway  (Figure  3).  We  conclude  that  a 
transient  interaction  between  Ubrlp  and  Ubc2p  is  function¬ 
ally  suboptimal  but  still  sufficient  for  the  multiubiquityl- 
ation  of  a  substrate  and  for  the  substrate’s  delivery  to  the 
26S  proteasome. 

The  RING-H2  finger  of  Ubrlp  is  essential  for 
degradation  of  N-end  rule  substrates 

Given  the  proximity  of  the  104  residue  RING-H2  finger 
to  the  BRR  region  of  Ubrlp  that  mediates  its  binding  to 
Ubc2p  (Figure  1),  we  determined  the  effects  of  missense 
mutations  in  RING-H2  (Figure  IB)  on  the  ability  of  Ubrlp 
to  target  N-end  rule  substrates  for  ubiquitylation  and 
degradation.  Since  the  structure  of  the  folded  RING-H2 
domain  is  unknown,  we  constructed  single  missense 
mutants  throughout  this  domain  (Figure  IB).  The 
sequences  RING-H2^R^  RING-H2^^^^^  RING-H2^^3 
RING-H2^^^  contained  single  Cys^Ser  substitutions  at 
the  Cysl,  Cys2,  Cys4  and  Cys5  positions  of  RING-H2, 
respectively  (Figure  1 A  and  B).  RING-H2^^^^  contained 
a  His^Ala  substitution  at  the  Hisl  position  of  RING-H2 
(Figure  lA  and  B).  These  RING-H2  variants  were  intro¬ 
duced  into  wt  Ubrl.  The  resulting  mutant  proteins, 
Ubrlp^^^  Ubrlp^^^,  Ubrlp^^^,  Ubrlp^^^^  and 
Ubrlp^^^^,  were  expressed  in  ubrl  A  cells  from  a  low- 
copy  plasmid,  the  native  Pubri  promoter,  and  in  the 
presence  of  either  Arg-Pgal  or  Leu-Pgal,  as  described  for 
the  BRR-domain  mutants  of  Ubrlp  (Figure  3). 

In  striking  contrast  to  the  negligible  effect  of  mutations 
in  the  RING-H2  finger  of  Ubrlp  on  the  binding  of  Ubrlp 
to  Ubc2p  (Figures  1C,  D  and  2),  all  of  the  tested 
RING-H2  mutations  except  MR12  had  a  major  inhibitory 
effect  on  the  degradation  of  N-end  rule  substrates 
(Figure  4).  Specifically,  Ubrlp^^\  which  bore  a  Cys^Ser 
substitution  at  Cysl  of  the  RING-H2  finger,  was  com¬ 
pletely  inactive  as  an  E3  (N-recognin),  in  that  the 
steady-state  levels  of  Arg-pgal  and  Leu-Pgal  in  cells 
expressing  Ubrlp^^^  were  indistinguishable  from  the 
levels  of  these  substrates  in  ubrl  A  cells,  which  lacked  the 
N-end  rule  pathway  (Figure  4).  Other  mutations  of  the 
RING-H2  finger  impaired  the  N-end  rule  pathway  strongly 
but  incompletely,  whereas  the  MR  12  mutation  (Hi s^ Ala 
at  Hisl  of  RING-H2)  had  a  relatively  weak  effect 
(Figure  4). 

Previous  work  has  shown  that  Ubrlp  is  a  rate-limiting 
component  of  the  N-end  rule  pathway,  in  that  overexpres¬ 
sion  of  Ubrlp  accelerated  the  degradation  of  N-end 
rule  substrates  (Bartel  et  aL,  1990).  We  introduced  the 
apparently  inactive  Ubrlp^^^  and  the  partially  active 
Ubrlp^^^  mutations  into  wt  Ubrlp  bearing  the  N-terminal 
FLAG  epitope  and  expressed  the  resulting  proteins  from 
the  Padhi  promoter  on  a  high-copy  plasmid.  Overexpres¬ 
sion  of  the  partially  active  Ubrlp^^^  increased  the  activity 
of  the  N-end  rule  pathway  in  comparison  with  cells 
expressing  Ubrlp^^^  at  wt  levels  (Figure  4).  In  contrast, 
the  effect  of  overexpressing  Ubrlp^^^  was  marginal,  in 
that  only  -*4%  of  Leu-Pgal  and  <20%  of  Arg-Pgal  were 
degraded,  in  comparison  with  virtually  no  degradation  at 
physiological  levels  of  Ubrlp^^^  (Figure  4).  Finally,  the 
-90  kDa  in  vivo  cleavage  product  of  pgal  (denoted  by  an 
asterisk),  which  is  characteristic  of  short-lived  X-Pgal 
substrates  and  is  not  produced  in  the  absence  of  either 
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Fig.  4.  Proteolysis  by  the  N-end  rule  pathway  requires  the  RING-H2 
finger  of  Ubrlp.  Methods  described  in  the  legend  to  Figure  3  and 
Materials  and  methods  were  used  to  assess  the  degradation  of  test 
substrates  Arg-pgal  (A  and  B)  and  Leu-pgal  (A)  in  the  presence  of 
different  UBRl  alleles  (see  Figure  IB).  All  UBRl  alleles  were 
expressed  from  the  native  Pubri  promoter  in  a  low-copy  vector.  The 
UbrlpMRi  (ADHIMRI)  and  Ubrlp^^^  (ADH1MR2)  mutants  were 
also  expressed  from  the  Padhi  promoter  in  a  high-copy  vector. 
Constructs  expressing  wt  Ubrlp  and  its  mutants  were  transformed 
into  JD55  (ubrl A)  cells  expressing  the  Arg-Pgal  or  Leu-pgal 
[(A),  pgal  activity  assays],  or  DHFRha-Ub^'^^-Arg-pgal  [(B), 
immunoprecipitation  assays].  The  activity  of  pgal  in  the  absence  of 
UBRl  (vector  alone)  was  set  at  100%.  The  values  shown  are  the 
means  of  duplicate  measurements  of  three  independent  transformants. 
Standard  deviations  are  indicated  above  the  bars.  In  (B),  the  asterisk 
denotes  the  90  kDa  Pgal  cleavage  product  characteristic  of  the  short¬ 
lived  X-Pgals  (Bachmair  et  al,  1986).  Ubrlp  co-immunoprecipitated 
with  anti-pgal  antibody  is  also  indicated.  The  measured  ratio  of 
Arg-Pgal  to  DHFR-Ub  in  the  absence  of  Ubrlp  [(B),  lane  vector]  was 
set  at  1.00  and  used  to  normalize  the  other  ratios. 


Ubrlp  or  Ubc2p  (Dohmen  et  aL,  1991),  was  nearly  absent 
with  Ubrlp^^^  (Figures  4B  and  5B);  this  result  was 
consistent  with  negligible  functional  activity  of  Ubrlp^^k 
Previous  work  has  shown  that  the  N-end  rule  pathway 
controls  the  import  of  peptides  through  degradation  of  the 
homeodomain  protein  Cup9p,  which  inhibits  the  expres¬ 
sion  of  the  Ptr2p  peptide  transporter  (Byrd  et  al,  1998). 
We  examined  whether  cells  expressing  Ubrlp^^^  instead 
of  wt  Ubrlp  were  perturbed  in  their  ability  to  import 
peptides  by  comparing  the  growth  of  lys2A  ubrl  A  cells 
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Fig.  5.  The  RING-H2  finger  of  Ubrlp  is  required  for  ubiquity lation  of 
N-end  rule  substrates.  (A)  The  RING-H2  finger  is  not  required  for  the 
binding  of  N-end  rule  substrates  by  Ubrlp.  The  UPR-based  fusion 
DHFRha-Ub^'^^-Leu-Pgal  (see  the  text  and  Materials  and  methods) 
was  co-expressed  in  AVY301  {ubrlls.  ubc2A)  cells  with  the  FLAG- 
tagged  wt  Ubrlp  (lane  3),  Ubrlp^^^  (lane  4),  Ubrlp^^^  5^  qj. 
vector  alone  (lane  2).  The  Ubrlp  proteins  were  expressed  from  the 
Padhi  promoter  in  a  high-copy  vector.  Lane  1,  AVY301  cells 
expressing  wt  Ubrlp  alone.  Extracts  from  cells  labeled  with 
[^^S] methionine  were  immunoprecipitated  with  anti-j3gal  (top  panel)  or 
anti-FLAG  antibodies  (lower  panel),  followed  by  SDS-8%  PAGE  (see 
Materials  and  methods).  (B)  The  RING-H2  finger  of  Ubrlp  is  required 
for  ubiquitylation  of  N-end  rule  substrates.  JD55  (ubrlA)  cells 
expressing  Leu-pgal  and  different  amounts  of  Ubrlp  or  its  mutants 
were  labeled  with  [^^S]methionine  for  6  min  at  30°C,  followed  by  the 
addition  of  NEM  to  50  mM  and  preparation  of  extracts  by  heating  at 
100°C  in  the  presence  of  SDS  (see  Materials  and  methods).  The 
extracts  were  precipitated  with  anti-pgal  antibody  and  fractionated  by 
SDS-6%  PAGE.  Note  the  virtual  absence  of  the  multiubiquitylated 
Leu-pgal  species  (denoted  Ub^-Pgal  on  the  right)  from  cells  expressing 
Ubrlp*^*^^  (lane  3).  The  asterisk  denotes  the  90  kDa  pgal  cleavage 
product  characteristic  of  short-lived  X-pgals  (Bachmair  et  al,  1986). 

expressing  either  wt  Ubrlp,  Ubrlp^^^  or  Ubrlp^®^  on 
plates  containing  Lys-Ala  dipeptide  as  the  sole  source  of 
lysine.  Under  these  conditions,  inactivation  of  the  N-end 
rule  pathway  is  lethal,  owing  to  the  inability  of  cells 
(auxotrophic  for  lysine)  to  import  the  Lys-Ala  dipeptide 
(Byrd  et  al,  1998).  Cells  expressing  Ubrlp^^^  which  is 
virtually  inactive  as  N-recognin,  failed  to  grow  on  Lys- 
Ala  dipeptide  plates,  whereas  cells  expressing  either  wt 
Ubrlp  or  Ubrlp^®^  survived  (data  not  shown),  consistent 
with  the  effects  of  the  MRl  and  MB  6  mutations  on  the 
activity  of  the  N-end  rule  pathway  (Figures  3  and  4). 

We  conclude  that  at  least  Cysl  of  the  RING-H2  finger 
in  Ubrlp  is  essential  for  the  E3  function  of  Ubrlp  in  the 


N-end  rule  pathway,  even  though  this  cysteine  is  not 
required  for  the  binding  of  Ubrlp  to  Ubc2p.  Yet  another 
insight  from  the  pulse-immunoprecipitation  assays  with 
Arg-|3gal  was  that  the  anti-pgal  antibody  co-precipitated 
the  partially  active  FLAG-Ubrlp^^^  and  the  inactive 
FLAG-Ubrlp^^^  from  the  extracts  of  cells  that  over¬ 
expressed  these  Ubrlp  mutants  (Figure  4B).  Thus,  the 
functionally  inactivating  Cysl^Serl  mutation  in  the 
RING-H2  finger  of  Ubrlp^^^  may  not  have  impaired 
the  binding  of  substrates  by  Ubrlp^^^  This  conjecture  is 
confirmed  below. 

RING-H2  of  Ubrlp  is  not  required  for  the  binding 
of  N-end  rule  substrates  by  Ubrlp 

To  compare  directly  the  affinities  of  Ubrlp  and  its  deriva¬ 
tives  for  an  N-end  rule  substrate,  FLAG-Ubrlp,  FLAG- 
Ubrlp^^^  or  FLAG-Ubrlp^^^  were  overexpressed  in  a 
ubrlA  ubc2A  strain,  which  also  expressed  Leu-Pgal  (pro¬ 
duced  from  Ub-Leu-Pgal).  Owing  to  the  absence  of 
Ubc2p,  Leu-Pgal  was  long-lived  in  this  strain,  irrespective 
of  the  presence  of  Ubrlp  (Dohmen  et  al,  1991).  Controls 
included  congenic  cells  expressing  FLAG-Ubrlp  alone  or 
Leu-Pgal  alone.  Cells  were  labeled  with  [^^S] methionine 
for  30  min;  proteins  were  immunoprecipitated  from 
extracts  with  anti-Pgal  antibody,  followed  by  SDS-PAGE. 
Anti-pgal  co-immunoprecipitated  Leu-pgal  and  compar¬ 
able  amounts  of  either  wt  FLAG-Ubrlp,  FLAG-Ubrlp^^^ 
or  FLAG-Ubrlp^^^  (Figure  5A,  upper  panel,  lanes  3-5). 
This  co-immunoprecipitation  was  specific  for  Leu-Pgal, 
because  no  FLAG-Ubrlp  was  brought  down  with  anti- 
Pgal  in  the  absence  of  Leu-pgal  (Figure  5A,  upper  panel, 
lane  1).  The  expression  levels  of  FLAG-Ubrlp,  FLAG- 
Ubrlp^^^  and  FLAG-Ubrlp^^^  were  nearly  identical,  as 
could  be  shown  by  immunoprecipitating  these  proteins 
with  anti-FLAG  antibody  (Figure  5 A,  lower  panel).  In 
contrast  to  the  results  with  anti-Pgal  antibody  (Figure  5A, 
upper  panel),  Leu-Pgal  was  not  co-precipitated  with  the 
FLAG- tagged  Ubrlp  proteins  by  anti-FLAG  antibody 
(Figure  5A,  lower  panel).  The  probable  explanation  is  that 
under  these  conditions,  only  free  (overproduced)  FLAG- 
Ubrlp  proteins  could  be  immunoprecipitated  by  anti- 
FLAG,  because  in  the  Leu-pgal/FLAG-Ubrlp  complex 
Leu- Pgal  is  bound  to  the  type  2  substrate-binding  site 
of  Ubrlp,  located  in  the  N-terminal  region  of  Ubrlp 
(A.Webster,  M.Ghislain  and  A. Varshavsky,  unpublished 
data).  As  a  result,  the  N-terminus  of  Ubrlp  could  be 
shielded  from  recognition  by  the  bulky  pgal  moiety  bound 
to  Ubrlp. 

Taken  together,  these  results  (Figures  1-5)  indicated 
that  the  MRl  (Cysl-^Serl)  mutation  in  the  RING-H2 
finger  of  Ubrlp  virtually  eliminated  the  E3  function  of 
Ubrlp,  but  did  not  impair  its  affinity  for  either  N-end  rule 
substrates  or  the  Ubc2p  Ub-conjugating  enzyme. 

The  RiNG-H2  finger  of  Ubrlp  is  required  for 
ubiquitylation  of  N-end  rule  substrates 

To  examine  ubiquitylation  of  an  N-end  rule  substrate  in 
cells  containing  mutant  Ubrlp  proteins,  FLAG-Ubrlp^^^ 
FLAG-Ubrlp^^^  FLAG-Ubrlp^^^i^^  FLAG-UbrlpMB6 
(Figure  IB)  and  the  wt  FLAG-Ubrlp  were  expressed  in 
a  ubrlA  UBC2  strain,  which  also  expressed  Leu-pgal 
(produced  from  Ub-Leu-Pgal).  To  minimize  the  extent  of 
the  degradation  of  Leu-pgal  during  labeling,  the  duration 
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of  the  [^^SJmethionine  pulse  was  reduced  to  6  min.  At  the 
end  of  the  pulse,  high  concentrations  of  A-ethylmaleimide 
(NEM),  the  inhibitor  of  DUBs  (Johnsson  and  Varshavsky, 
1994),  were  employed  to  inhibit  deubiquitylation  after 
cell  lysis.  ^^S-labeled,  immunoprecipitated  Leu-Pgal  was 
fractionated  by  SDS-PAGE  (Figure  5B). 

Multiubiquitylated  derivatives  of  Leu-pgal  were 
observed  in  the  presence  but  not  in  the  absence  of  wt 
Ubrlp,  and  the  extent  of  ubiquity lation  increased  upon 
overexpression  of  FLAG-Ubrlp  (Figure  5B,  lanes  1,  2 
and  9).  However,  virtually  no  ubiquitylated  Leu-pgal  was 
observed  with  Ubrlp^^\  the  degradation-inactive 
RING-H2  mutant  of  Ubrlp  (Figure  5B,  lane  3,  compare 
with  lanes  1  and  2).  Cells  overexpressing  FLAG-Ubrlp^^^ 
contained  trace  amounts  of  multiubiquitylated  Leu-Pgal, 
in  comparison  with  much  higher  levels  of  these  species 
in  cells  overexpressing  wt  FLAG-Ubrlp  (Figure  5B, 
lane  7,  compare  with  lane  9).  In  agreement  with  the  results 
of  degradation  tests  (Figure  4),  the  partially  degradation- 
defective  RING-H2  mutant  Ubrlp^^^  yielded  intermediate 
levels  of  the  multiubiquitylated  Leu-pgal,  and  ubiquityl- 
ation  was  strongly  enhanced  upon  overexpression  of  FLAG- 
UbripMR2  (pigure  5B,  lanes  4  and  8).  Ubrlp^^^^  whose 
ability  to  mediate  the  degradation  of  N-end  rule  substrate 
was  similar  to  that  of  wt  Ubrlp  (Figure  4),  yielded  the 
ubiquitylation  pattern  indistinguishable  from  that  produced 
by  wt  Ubrlp  (Figure  5B,  lane  6).  Consistent  with  the  results 
of  Pgal  activity  assays  (Figure  3),  Ubrlp^®^,  which  bore 
three  mutations  in  the  BRR  domain  and  was  severely 
impaired  in  its  binding  to  Ubc2p  (Figures  1-3),  yielded 
nearly  wt  levels  of  multiubiquitylated  Leu-pgal  (Figure  5B, 
lane  5,  compare  with  lane  2).  Taken  together,  these  results 
indicated  that  the  RING-H2  finger  of  Ubrlp  is  specifically 
required  for  the  Ubrlp,  Ubc2p-dependent  multiubiquityl- 
ation  of  N-end  rule  substrates,  and  that  Cysl  of  the  RING- 
H2  finger  is  a  functionally  critical  residue  of  this  domain. 

Dominant-negative  effect  of  the  ubiquityiation- 
inactive  RiNG-H2  mutant  of  Ubrlp 

The  earlier  gel  filtration  data  suggested  that  Ubrlp  functions 
as  a  monomer  (Bartel,  1990).  In  addition,  the  immunopre- 
cipitation  of  FLAG-Ubrlp  with  anti-FLAG  antibody  from 
extracts  of  cells  that  overexpressed  both  FLAG-Ubrlp  and 
Ubrlp-HA  did  not  co-immunoprecipitate  Ubrlp-HA,  also 
suggesting  the  monomeric  configuration  of  Ubrlp 
(A.Webster  and  A. Varshavsky,  unpublished  data).  We  asked 
whether  the  functionally  inactive  Ubrlp^^^  could  exert  a 
dominant-negative  effect  on  the  N-end  rule  pathway.  Either 
wt  Ubrlp  or  Ubrlp^^^  (in  addition  to  vector  alone)  was 
overexpressed  from  the  Padhi  promoter  on  a  high-copy 
plasmid  in  UBRl  cells  that  also  expressed  the  UPR- 
based  fusions  which  produced  Arg-Pgal  (or  Leu- Pgal)  and 
equimolar  amounts  of  DHFR-Ub^^^,  the  reference  protein. 
Cells  were  labeled  with  [^^S]methionine  for  30  min,  fol¬ 
lowed  by  immunoprecipitation  of  Leu-pgal  (Figure  6A)  or 
Arg-Pgal  (Figure  6B),  as  well  as  the  reference  protein 
DHFR-Ub*^^^,  The  ratios  of  X-pgal  to  DHFR-Ub^"^^  were 
measured,  and  thereafter  normalized  by  the  same  ratios  in 
congenic  ubrlA  cells  (Figure  6A  and  B,  lane  1).  Whereas 
overexpresion  of  wt  Ubrlp  in  UBRl  cells  accelerated  the 
degradation  of  Leu-Pgal  and  Arg-Pgal,  overexpression  of 
UbripMRi  strongly  inhibited  this  degradation  (Figure  6A 
and  B,  lanes  2  and  3,  compare  with  lanes  4).  The  dominant- 
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Fig.  6.  Dominant-negative  effect  of  Ubrlp^^\  the  RING-H2  mutant 
of  Ubrlp.  The  Padhi  promoter-based,  high-copy  vector  alone  (lane  2), 
and  its  derivatives  expressing  the  FLAG-tagged  wt  Ubrlp  (lane  3)  or 
Ubrlp^^^  (lane  4)  were  transformed  into  JD52  {UBRl)  cells 
expressing  the  UPR-based  fusion  DHFRha-Ub^'^^-Leu-pgal  (A)  or 
DHFRha-Ub^'^^-Arg-pgal  (B).  Cells  were  labeled  with 
[^^SJmethionine  for  30  min  at  30°C,  followed  by  extraction, 
immunoprecipitation  with  a  mixture  of  anti-pgal  and  anti-HA 
antibodies,  and  SDS-12%  PAGE,  and  quantitation.  Lanes  1  in  (A)  and 
(B)  show  the  results  of  the  same  tests  with  JD55  {uhrlA)  cells  in  the 
absence  of  any  version  of  Ubrlp.  The  measured  ratio  of  the  test 
protein  (Leu-pgal  or  Arg-pgal)  to  the  reference  protein  (DHFRha- 
Ub*^^^)  in  lanes  1  was  set  at  LOO.  The  ratios  determined  in  the  other 
settings  and  normalized  against  the  lane-1  ratios  are  shown  below  the 
lanes. 


negative  effect  of  overexpressed  Ubrlp^^^  was  caused  at 
least  in  part  by  the  sequestration  of  endogenous  Ubc2p, 
inasmuch  as  this  effect  could  be  partially  reversed  through 
overexpression  of  Ubc2p  (data  not  shown).  The  sequestra¬ 
tion  of  N-end  rule  substrates  by  Ubrlp^^^  which  could 
bind  to  substrates  indistinguishably  from  wt  Ubrlp 
(Figure  5),  was  another  likely  (and  independent)  cause  of 
the  dominant-negative  effect  of  Ubrlp^^^  (Figure  6).  The 
possibility  that  overexpression  of  Ubrlp^^^  might  also 
interfere  with  the  downstream  components  of  the  Ub  system, 
such  as  the  26S  proteasome,  was  ruled  out  by  the  observation 
that  the  degradation  of  Ub-Pro-Pgal  was  not  affected  by 
the  overexpression  of  Ubrlp^^^  (data  not  shown).  Ub- 
Pro-pgal  is  a  short-lived  protein  degraded  by  the  UFD 
(Ub/fusion/degradation)  pathway,  which  targets  the  non¬ 
removable  (or  slowly  removable)  N-terminal  Ub  moiety  of 
a  fusion  protein  (Johnson  et  al,  1995;  Koegl  et  al,  1999). 

Discussion 

We  report  two  main  results.  First,  we  identified  and  dissected 
the  E2  enzyme-binding  site  of  Ubrlp,  the  E3  component 
of  Ub-protein  ligase  in  the  N-end  rule  pathway  of  S.cerevis- 
iae.  This  site,  the  BRR  domain,  is  shown  to  bind  the  Ubc2p 
Ub-conjugating  (E2)  enzyme  mainly  through  electrostatic 
interactions  between  the  basic  residues  of  BRR  and  the 
polyacidic  C-terminal  tail  of  Ubc2p.  Although  high-affinity 
interactions  between  the  E3  Ubrlp  and  the  E2  Ubc2p  are 
essential  for  normal  activity  of  the  N-end  rule  pathway, 
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even  much  weaker  interactions  between  mutant  variants  of 
Ubrlp  and  Ubc2p,  the  interactions  which  could  not  be 
detected  by  either  two-hybrid  or  pull-down  assays,  were 
found  to  be  sufficient  for  sustaining  a  large  fraction  of  the 
pathway’s  activity.  Secondly,  we  discovered  the  requirement 
for  the  RING-H2  finger  of  Ubrlp  in  the  ubiquity lation  of 
substrates  mediated  by  the  Ubrlp-Ubc2p  complex.  A  single 
Cys-~>Ser  missense  mutation  in  the  RING-H2  finger  (which 
is  adjacent  to  the  BRR  domain  in  Ubrlp)  severely  impairs 
ubiquitylation  and  degradation  of  N-end  rule  substrates,  but 
does  not  impair  the  ability  of  Ubrlp  to  bind  the  substrates 
and  Ubc2p.  These  findings  defined  the  physical  and  func¬ 
tional  aspects  of  the  Ubc2p-Ubrlp  interaction,  and  revealed 
a  specific  essential  function  of  the  RING-H2  finger.  This 
domain  is  present  not  only  in  the  members  of  the  UBR 
family  of  E3s  (Kwon  et  ai,  1998)  but  also  in  otherwise 
dissimilar  E3  proteins  of  the  Ub  system. 

Physical  and  functional  aspects  of  E2-E3  interaction 

Ubrlp,  the  225  kDa  E3  protein,  was  shown  to  interact  with 
the  20  kDa  Ubc2p  E2  enzyme  mainly,  if  not  exclusively, 
through  the  BRR  domain  of  <15  kDa.  This  interaction 
requires  the  polyacidic  C-terminal  tail  of  Ubc2p  and  has  a 
strong  electrostatic  component,  because  conversions  of 
single  basic  residues  in  the  BRR  domain  into  neutral  ones 
resulted  in  comparable  decreases  in  the  affinity  of  Ubrlp 
for  Ubc2p.  Moreover,  this  affinity  was  decreased  in  rough 
proportion  to  the  number  of  mutated  basic  residues  in  BRR 
(Figures  1  and  2).  These  results  are  consistent  with  the 
reported  dependence  of  the  Ubrlp-Ubc2p  interaction  on 
the  length  of  the  polyacidic  tail  of  Ubc2p.  Specifically,  the 
truncated  Ubc2p  derivative  that  lacked  eight  of  23  residues 
of  the  polyacidic  tail  retained  high  affinity  for  Ubrlp,  but 
further  shortening  of  the  Ubc2p  polyacidic  tail  strongly 
decreased  the  affinity  (Madura  et  aL,  1993). 

Previous  work  suggested  that  the  9  residue  N-terminal 
region  of  Ubc2p  was  essential  for  the  binding  of  Ubc2p  to 
Ubrlp  (Watkins  et  al,  1993).  Our  current  data  demonstrated 
that  deletion  of  the  first  9  residues  of  Ubc2p  weakened  but 
did  not  eliminate  its  interaction,  in  the  two-hybrid  assay, 
with  the  287  residue  fragment  of  Ubrlp  that  contained  the 
BRR  domain  and  the  RING-H2  finger  (Figure  1).  It  is  not 
clear  how  the  N-terminal  9  residues  of  Ubc2p  contribute 
to  the  interaction  with  Ubrlp.  Inasmuch  as  Arg6  and  Arg8 
of  the  folded  Ubc2p  are  likely  to  interact  through  a  hydrogen 
bond  (Worthylake  et  a/.,  1998),  it  is  possible  that  the  effect 
of  the  9  residue  N-terminal  deletion  is  indirect,  stemming 
from  a  conformational  destabilization  of  Ubc2p.  Our  finding 
of  the  287  residue  Ubrlp  fragment  that  contains  the  BRR 
and  RING-H2  domains  and  is  sufficient  for  the  high-affinity 
binding  to  Ubc2p  (Figures  1  and  2)  should  facilitate  the 
dissection  of  this  E2-E3  interaction  at  the  atomic  level 
through  co-crystallization  of  the  previously  solved  Ubc2p 
(Worthylake  et  aL,  1998)  and  the  287  residue  fragment 
of  Ubrlp. 

Function  of  the  RING-H2  finger  of  Ubrlp 

One  of  the  major  results  of  this  work  is  the  discovery  of 
an  essential  and  specific  requirement  for  the  RING-H2 
finger  of  Ubrlp  in  the  ubiquitylation  of  N-end  rule  substrates 
mediated  by  the  Ubrlp-Ubc2p  complex.  The  RING-H2 
motif  is  a  distinct  variant  of  the  previously  defined  Cys/ 
His-rich  RING  motif.  In  several  proteins  outside  the  Ub 


system,  RING-H2  has  been  shown  to  function  as  a  site  of 
protein-protein  interactions  (Borden  and  Freemont,  1996; 
Inouye  et  al.,  1997;  Kamura  et  al.,  1999).  In  addition  to 
the  UBR  family  of  E3  proteins  (Kwon  et  ai,  1998),  several 
other  E3  components  of  the  Ub  system,  including  Ape  lip. 
Roc  Ip  (Rbxlp  or  Hrtlp),  Roc2p  and  their  homologs  in 
multicellular  eukaryotes,  have  also  been  found  to  contain 
RING-H2  or  other  variants  of  RING  (Yu  et  al,  1998; 
Zachariae  et  al,  1998;  Ohta  et  aL,  1999;  Seol  et  aL,  1999; 
Skowyra  et  aL,  1999;  Tan  et  aL,  1999). 

The  RING-H2  of  Ubrlp  is  adjacent  to  the  BRR  domain, 
which  mediates  the  physical  interaction  between  Ubrlp  and 
Ubc2p  (Figures  1  and  2).  A  single  Cys^Ser  mutation 
at  position  1  of  the  RING-H2  finger  abolished  ubiquityl¬ 
ation  and  degradation  of  N-end  rule  substrates,  but  did 
not  impair  the  ability  of  the  resulting  mutant,  Ubrlp^^^ 
to  bind  both  the  substrates  and  Ubc2p  (Figures  4-6).  The 
Cys-^Ser  mutations  at  other  positions  of  the  RING-H2 
finger  had  similar  but  partial  effects,  while  the  His-»Ala 
mutation  at  Hisl  position  of  RING-H2  had  the  smallest 
effect  among  the  mutants  tested  (Figures  4-6).  Thus,  even 
though  the  Ubc2p  E2  enzyme  can  interact  with  the  El 
enzyme  to  yield  the  Ubc2p~Ub  thioester,  and  can  also 
produce  isopeptide  bond-mediated  Ub-protein  conjugates 
in  the  absence  of  E3  (Jentsch  et  aL,  1987),  the  synthesis 
of  a  multi-Ub  chain  by  the  Ubc2p-Ubrlp  complex  requires 
the  presence  of  cysteine  at  position  1  of  the  RING-H2 
finger  in  Ubrlp. 

The  Ub  system  of  a  cell  contains  a  number  of  distinct 
E2  and  E3  proteins  (Hochstrasser,  1996;  Varshavsky,  1997; 
Hershko  and  Ciechanover,  1998).  An  E3  protein  of  a  Ub- 
dependent  proteolytic  pathway  was  initially  presumed  to 
function  as  a  non-catalytic,  substrate-recognition  compon¬ 
ent  of  the  corresponding  Ub-protein  ligase,  which  consists 
of  an  E3  and  its  associated  E2  enzyme.  Ubrlp  (N-recognin) 
of  the  present  work  was  the  first  E3  that  has  been 
shown  to  bind  proteins  bearing  specific  degradation  signals 
(Bachmair  et  aL,  1986;  Bartel  et  aL,  1990).  More  recently, 
it  was  found  that  in  addition  to  recognizing  specific 
substrates,  at  least  some  E3s,  specifically  those  that  contain 
the  HECT  domain  (Scheffner  et  aL,  1993),  also  function 
as  enzymes.  An  E3  of  this  class  can  accept  the  Ub  moiety 
from  an  associated  E2~Ub  thioester,  initially  forming  the 
E3~Ub  thioester  confined  to  a  specific  (and  essential) 
cysteine  of  E3.  The  Ub  moiety  of  this  E3-Ub  thioester  is 
thereafter  conjugated,  through  the  isopeptide  bond,  either 
directly  to  a  substrate  protein  or  to  the  Ub  moiety  of  a 
growing,  substrate-linked  multi-Ub  chain  (Scheffner  etaL, 
1995).  Other  classes  of  E3  proteins,  including  the  SCF, 
APC  and  UBR  families  (Hershko  and  Ciechanover,  1998; 
Kwon  et  aL,  1998;  Tyers  and  Willems,  1999),  lack 
recognizable  HECT  domains,  and  have  not  thus  far  been 
demonstrated  to  form  thioesters  with  Ub. 

The  mutations  tested  in  the  RING-H2  finger  of  Ubrlp 
(including  the  Cysl  mutation  which  abolished  the  ubi¬ 
quitylation  activity  of  the  Ubrlp-Ubc2p  complex)  did  not 
impair  the  ability  of  Ubrlp  to  bind  the  Ubc2p  enzyme 
and  N-end  rule  substrates  (Figures  1-5).  Thus,  the  specific 
functional  effect  of  these  mutations  cannot  be  due  to 
a  global  conformational  destabilization  of  Ubrlp.  We 
conclude  that  the  Ubrlp  (E3)  component  of  the  N-end 
rule  pathway  not  only  recognizes  specific  degradation 
signals  in  substrate  proteins,  but  in  addition  directly 


6840 


RING-H2  finger  and  E3  function 


participates,  through  a  mechanism  that  requires  the 
RING-H2  domain,  in  the  Ubc2p-mediated  synthesis  of  a 
substrate-linked  multi-Ub  chain.  In  the  first  of  possible 
models,  a  specific  cysteine  of  RING-H2,  perhaps  the 
essential  Cysl  residue  (Figure  1),  acts  as  an  acceptor  of 
the  Ub  moiety  transesterified  from  the  Ubc2p~Ub  thioester, 
similar  to  the  one  previously  described  mechanism  of  the 
HECT-domain  E3  called  E6AP  (Scheffner  et  ai,  1995; 
Huibregtse  et  al,  1998).  Earlier  experiments  with  an 
in  vitro  ubiquity lation  system  containing  purified  Sxerevis- 
iae  Ubrlp  and  other  targeting  components  of  the  N-end 
rule  pathway  suggested  the  Ubc2p-dependent  formation 
of  a  Ubrlp'-Ub  thioester  (V.Chau  and  A. Varshavsky, 
unpublished  data).  Attempts  to  rigorously  verify  the  func¬ 
tionally  relevant  presence  of  such  a  thioester  in  the 
completely  defined,  Ubrlp-based  in  vitro  system  have 
not  been  successful  thus  far  (F.Du  and  A. Varshavsky, 
unpublished  data).  Another  possible  role  of  the  cysteine(s) 
of  the  RING-H2  finger  in  Ubrlp  may  be  to  modify  the 
immediate  vicinity  of  the  active  cysteine  (Cys88)  of  the 
Ubrlp-associated  Ubc2p  in  a  way  that  facilitates  the  multi- 
Ub  synthesis  by  Ubc2p,  in  the  absence  of  Ubrlp-Ub 
thioester  formation.  Physical  proximity  between  the 
Ubc2p-binding  BRR  domain  of  Ubrlp  and  its  RING-H2 
domain  (Figure  lA)  is  consistent  with  this  mechanism.  A 
third  model  posits  the  existence  of  an  unidentified  factor 
that  binds  to  the  RING-H2  finger  of  Ubrlp,  and  is  required 
for  the  efficient  synthesis  of  multi-Ub  by  the  Ubrlp- 
Ubc2p  complex.  Jentsch  and  colleagues  (Koegl  et  aL, 
1999)  have  identified  a  factor,  termed  E4,  which  facilitates 
the  synthesis  of  a  multi-Ub  chain  linked  to  a  substrate  of 
the  S.cerevisiae  UFD  pathway.  This  factor,  encoded  by 
the  UFD2  gene,  is  required  for  the  in  vivo  degradation  of 
Ub^^^-Val-Pgal,  but  is  not  required  for  the  degradation  of 
a  much  smaller  and  monomeric  UFD  substrate  such  as 
UI^v76_y^I_j)jIPr.  Ufd2p  is  also  not  required  for  the 
degradation  of  pgal-based  N-end  rule  substrates  (Johnson 
et  a/.,  1995).  Whether  the  function  of  the  RING-H2  finger 
of  Ubrlp  is  mediated  by  a  finger-binding  protein  analogous 
to  E4  remains  to  be  determined. 

The  RING-H2  finger  and  its  variants  are  present  in 
several  otherwise  dissimilar  classes  of  E3  proteins  in  the 
Ub  system.  For  example,  the  mammalian  E3  encoded  by 
the  Mdm2  gene  and  specific  for  the  p53  substrate  contains 
a  putative  RING  finger  (Elenbaas  et  al.,  1996);  the 
conserved  Cys464  residue  of  this  domain  was  proposed 
to  define  an  active  site  analogous  to  the  Ub  thioester- 
forming  site  of  the  HECT-domain  E3  enzymes  (Honda 
et  al,  1997).  Roclp  (Rbxlp,  Hrtlp),  a  recently  identified 
subunit  of  the  cell  cycle-regulating  Ub-protein  ligase 
complex  called  SCF,  contains  a  variant  of  RING-H2  in 
which  Cys6  is  replaced  by  Asp  (Kamura  et  al,  1999). 
Mutagenesis-based  analyses  suggested  that  the  RING 
finger  of  Roclp  was  required  for  the  function  of  the  SCF 
complex  (Kamura  et  aL,  1999;  Ohta  et  aL,  1999).  The 
sequence  of  Roclp  (Rbxlp,  Hrtlp)  is  highly  similar  to 
that  of  Apcllp,  the  RING-H2-containing  subunit  of 
another  cell  cycle-regulating  Ub-protein  ligase,  called 
APC  or  cyclosome  (Yu  et  aL,  1998;  Zachariae  et  al., 
1998).  The  RING-H2  finger  and  its  variants  have  also 
been  found  in  other  (possibly  also  E3)  components  of  the 
Ub  system,  including  Siah-1  and  Der3p  (Hrdlp),  which 
are  required,  respectively,  for  the  degradation  of  the  DCC 


protein  (deleted  in  colorectal  cancer)  (Hu  and  Fearon, 
1999)  and  of  the  misfolded  lumenal  and  integral  membrane 
proteins  (Bordallo  et  aL,  1998). 

Further  dissection  of  the  RING  domains  in  these  and 
other  E3  proteins  will  determine  whether  the  ubi¬ 
quity  lation-enabling  function  of  RING-H2  in  Ubrlp  of 
the  N-end  rule  pathway  is  a  general  property  of  the  E3 
components  in  the  Ub  system. 

Materials  and  methods 

Strains,  media  and  fi-galactosidase  assay 

The  S.cerevisiae  strains  used  were  JD55  (MATa  trp}-A63  ura3-52  his3- 
A200  leu2-3,  112  lys2-801  ubrlA::HlS3)\  JD52  (MATa  trpl-A63  ura3- 
52  his3-A200  leu2-3,  112  lys2-801)\  AVY301  (a  ubc2A::URA3  derivative 
of  JD55);  and  PJ69-4A  (a  gift  from  P. James  and  E.Craig)  (MATa  trpl-109 
leu2-3,  112  ura3-52  his3-A200  gal4A  galSOA  GAL2-ADE2  LYS2::GAL1- 
H1S3  met::GAL7-lacZ).  Cultures  were  grown  in  rich  (YPD)  or  in 
synthetic  media  containing  standard  ingredients  (Sherman,  1991)  and 
either  2%  glucose  (SD  medium),  2%  galactose  (SG  medium)  or  2% 
raffinose.  Yeast  were  transformed  as  described  previously  (Chen  et  aL, 
1992).  For  the  induction  of  the  Pcupi  promoter,  CUSO4  was  added  to  a 
final  concentration  of  0.1  mM  (Ghislain  et  al.,  1996).  The  enzymatic 
activity  of  pgal  in  yeast  extracts,  from  cultures  at  OD^qo  of  0.8-1. 0,  was 
determined  as  described  previously  (Baker  and  Varshavsky,  1991),  using 
o-nitrophenyl-p-D-galactopyranoside  (ONPG).  The  Escherichia  coli 
strain  used  was  DH5a. 

Uses  of  the  two-hybrid  system 

The  UBC2  ORF  was  amplified  by  PCR  using  pKM1313  (Madura  and 
Varshavsky,  1994)  as  the  template  DNA.  The  two  primers  were  designed 
so  that  the  PCR  products  contained  a  Ncol  site  immediately  5'  to  the 
start  codon,  and  a  Sail  site  immediately  3'  to  the  stop  codon.  The  Ncol- 
Sall-cut  PCR  fragment  was  subcloned  into  Ncol-Sall-cvX  PAS 2  (Durfee 
et  al.,  1993),  yielding  pAS2-UBC2,  which  expressed  a  fusion  containing 
the  DNA-binding  domain  of  Gal4p  upstream  of  the  Ubc2p  moiety. 
The  construct  was  verified  by  nucleotide  sequencing,  and  also  by 
immunoblotting,  in  which  the  Gal4-Ubc2p  protein  was  detected  with  an 
anti-Gal4  antibody  (Santa  Cruz  Biotechnology,  Santa  Cruz,  CA).  Gal4- 
Ubc2p  retained  the  ability  of  Ubc2p  to  rescue  the  UV-sensitivity  defect 
of  ubc2A  cells.  The  PCR  primers  used  were  ggaaccatggagatgtccacacca- 
gctagaag  and  ccttgtcgactcagtctgcttcgtcgtcgt.  The  two-hybrid  DNA  librar¬ 
ies  and  the  host  S.cerevisiae  strain  PJ69-4A  were  gifts  from  P. James  and 
E.Craig  (James  et  al.,  1996).  PJ69-4A  expressed  three  reporters 
from  different  inducible  promoters:  Pgal2-^^^7,  and 

VQ^u-lacZ.  pAS2-UBC2  was  transformed  into  PJ69-4A.  Trp'*'  trans¬ 
formants  were  then  transformed  with  the  two-hybrid  libraries,  with 
selection  directly  on  SD  (-trp,  -leu,  -ade)  plates  at  30°C.  Ade'*’  colonies 
were  then  tested  on  SD  (-trp,  -leu,  -his)  plates.  Library  plasmids  were 
rescued  from  the  Ade"^  His'*'  transformants,  and  re-transformed  into 
PJ69-4A  expressing  either  pAS2-UBC2  or  the  control  vector  pAS2.  The 
plasmids  that  induced  ADE2  and  H1S3  in  the  presence  but  not  in  the 
absence  of  pAS2-UBC2-encoded  putative  Ubc2p-interacting  proteins, 
and  were  analyzed  by  partial  sequencing.  One  clone  thus  identified 
encoded  a  287  residue  fragment  (position  1134-1420)  of  the  ORF 
YLR024C,  which  was  previously  termed  UBR2  (Hochstrasser,  1996; 
Kwon  et  al,  1998),  as  described  in  Results. 

The  two-hybrid  system  was  also  used  to  qualitatively  assess  the 
interaction  of  either  Ubc2p  or  its  truncated  derivatives  with  the  BRR 
domain  and/or  the  RING-H2  finger  of  either  wt  Ubrlp  or  its  derivatives 
containing  point  mutations.  UBC2j_j4g,  encoding  the  C-terminally  trun¬ 
cated  Ubc2p  lacking  the  23  residue  polyacidic  tail,  was  constructed 
using  PCR  and  the  primers  ggaaccatggagatgtccacaccagctagaag  and 
gctagtcgacccaagatttctctaccgtctc.  The  PCR  products  were  cut  by  Sail  and 
BamHl,  and  subcloned  into  Sall-BamHl-cut  pGBDUCl  (James  et  al, 
1996),  yielding  pGBDUCl UBC2i_i49,  which  expressed  a  fusion  of 
Ubc2p]_i49  to  the  DNA  binding  domain  of  Gal4p.  Similarly, 
UBC2]o-i72y  encoding  Ubc2p  lacking  the  first  9  residues,  was  constructed 
using  the  primers  tacgggatccatgagagattttaaacgtatg  and  ccttgtcgactcagtctg¬ 
cttcgtcgtcgt.  The  PCR  products  were  digested  with  Sail  and  BamHl,  and 
subcloned  into  Sall-BamBl-cut  pGBDUCl  vector,  yielding  pGBDU¬ 
Cl  UBC2io-i72-  The  UBRl  fragment  encoding  residues  1081-1367, 
including  the  BRR  domain  and  the  RING-H2  finger,  was  produced 
using  PCR  and  the  primers  aactggatccggtaaggactctataccagaagct  and 
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ttccgtcgacctcagtaaaaggtttgaagagcct.  The  PCR  products  were  cut  with 
BamUl  and  Sail,  and  subcloned  into  BamWl-Sall-cul  pGADCl  (James 
et  al,  1996),  yielding  pGADClUBRljo8i-i367»  which  encoded  a  fusion 
of  Ubrlpio8i-i367  to  the  transcriptional  activation  domain  of  Gal4p. 
UBR] jo8}^]220^  encoding  the  Ubrlp  fragment  containing  the  BRR  domain 
but  not  the  RING-H2  finger,  was  constructed  using  the  primers 
tacgggatccatgagagattttaaacgtatg  and  ttccgtcgacgcaggtaaaatcctccgattc,  the 
above  procedure,  and  yielding  pGADClUBRlio8i_i220’  which  expressed 
a  fusion  of  Ubrlpio8]_]220  to  the  transcriptional  activation  domain  of 
GaI4.  UBR] encoding  the  Ubrlp  fragment  containing  the 
RING-H2  finger  but  not  the  BRR  domain,  was  constructed  using  the 
primers  aactggatccgcaaggcttctagccaag  and  ttccgtcgacgcaggtaaaatcctcc- 
gattc,  the  above  procedure,  and  yielding  pGADClUBRli,77_i367,  which 
expressed  a  fusion  of  Ubrlpii77_i357  to  the  transcriptional  activation 
domain  of  Gal4.  UBRl  alleles  containing  point  mutations  in  either  the 
BRR  domain  or  the  RING-H2  finger  were  produced  using  the 
QuickChange  mutagenesis  kit  (Stratagene,  La  Jolla,  CA).  Briefly,  an 
-850  bp  EcoRl-Bgin  fragment  from  pGADCIUBRl,08i_i367  that 
encoded  the  BRR  domain  and  the  RING-H2  finger  was  subcloned  into 
the  £'coRI— J5g/II-cut  LITMUS28  vector  (New  England  Biolabs,  Beverly, 
MA),  yielding  LITMUS28UBRl(£^cc>RI-5g/II),  which  was  subjected  to 
site-directed  mutagenesis.  The  resulting  plasmids  were  cut  with  fcoRI 
and  Bglll,  and  the  -850  bp  EcoRl-Bglll  fragments  containing  the 
intended  point  mutations  were  subcloned  back  into  EcoRI-fig/II-cut 
pGADClUBRlio8i_i367.yieldingpGADClUBRlMB10,pGADClUBR- 
IMBll,  pGADClUBRlMBl?,  pGADClUBRlMB4,  pGADClUBRl- 
MB5,  and  pGADClUBRlMB6;  pGADClUBRlMRl,  pGADClUB- 
R1MR2,  pGADClUBRlMR3,  pGADClUBRlMR12,  pGADClUBRl- 
MR18,  pGADClUBRlMR?  and  pGADCl  UBRl  MRS.  All  PCR  products 
and  inserts  containing  point  mutations  were  verified  by  DNA  sequencing 
on  both  strands.  These  mutants  belonged  to  either  the  MB  or  the  MR 
groups  (Figure  IB). 

UPR  plasmids  encoding  N-end  rule  substrates 

UPR  plasmids  were  the  yeast  vector-based  counterparts  of  the  plasmids 
originally  constructed  for  expression  in  mammalian  cells  (Levy  et  al, 
1996).  These  plasmids,  termed  pRPgalUPR  and  pLpgalUPR,  expressed, 
respectively,  DHFR-Ub-Arg-pgal  and  DHFR-Ub-Leu-Pgal  from  the 
Pcupi  promoter.  A  BamWl-SmaX  insert  of  pBARUPR  or  pBALUPR  was 
replaced  with  a  3.8  kb  pgal-encoding  insert  from  pUB23-M  (Bachmair 
et  al,  1986),  using  BamWl  and  Seal.  pBARUPR  and  pBALUPR  were 
derived  from  the  low-copy,  pRS314-based  pBAM  vector  (Ghislain  etaL, 
1996).  They  expressed,  respectively,  DHFR-Ub-Arg-Ura3p  and  DHFR- 
Ub-Leu-Ura3p  from  the  PcuPi  promoter  (A. Webster  and  A.Varshavsky, 
unpublished  data),  and  were  a  gift  from  A. Webster. 

Plasmids  encoding  Ubrlp^  Ubc2p  and  their  mutant 
derivatives 

UBRl  and  its  mutant  alleles  were  expressed  either  from  its  natural 
promoter  in  a  low-copy,  pRS315-derived  vector  or  from  the  Padhi 
promoter  in  a  high-copy,  YEplacl81-based  plasmid.  Briefly,  pRS315 
(Sikorski  and  Hieter,  1989)  was  cut  with  Xbal,  blunted  by  Klenow  Pol  I, 
and  thereafter  self-ligated,  yielding  pRS3\5i-Xbal)  lacking  the  Xbal 
site.  pUBRl  (Bartel  etal.,  1990)  was  cut  with  SaB  and  An  -8.1  kb 
insert  containing  the  ORF  and  promoter  of  UBRl  was  subcloned 
into  5^7/I-P5rI-cut  pRS315(-YM),  yielding  pRS315(-XM)UBRl.  To 
produce  mutant  ubrl  alleles,  a  ~2.9  kb  Bglll-Xbal  fragment  of  UBRl 
from  pRS315(-X6flI)UBRl  was  subcloned  into  LITMUS28.  The 
resulting  LITMUS28UBRl(fig/II-X/7fi[I)  was  digested  with  Hpal  and 
Sapl,  and  an  -1.4  kb  fragment  was  isolated.  LITMUS28UBRl(BgrtI- 
Xbal)  was  also  cut  by  Sapl,  and  an  -3.7  kb  band  was  isolated.  The  1.4 
and  3.7  kb  fragments  were  ligated  with  a  set  of  -600  bp  Sapl-Hpal 
fragments  from  the  point  mutation-containing  pGADCl UBR1MB4, 
pGADClUBRlMB6,  pGADClUBRlMRl,  pGADCl  UBR1MR2, 
pGADClUBRlMR3,  pGADClUBRlMR12  or  pGADClUBRlMRl 8, 
yielding,  respectively,  pLITSUM28UBRlMB4,  pLITMUS28UBRl- 
MB6,  pLITMUS28UBRlMRl,  pLITMUS28UBRlMR2,  pLITMUS28- 
UBR1MR3,  pLITMUS28UBRlMR12  and  pLITMUS28UBRlMR18.  A 
2.9  kb  Bglll-Xbal  fragment  from  each  of  these  plasmids  was  subcloned 
into  BgUl—Xbal-QMl  pRS315(— X^£2l)UBRl,  yielding,  respectively, 
pRS315(-Y/7«I)UBRlMB4,  pRS315(-YM)UBRlMB6,  pRS315 
(-XM)UBRIMRI,  pRS315(-X^aI)UBRlMR2,  pRS315(-X7»aI)UBRl- 
MR3,  pRS315(-Yi7aI)UBRlMR12  and  pRS315(~YM)UBRlMR18. 
pNTFlagUBRl  expressed  the  full-length  Ubrlp  bearing  the  N-terminal 
FLAG  epitope  (MDYKDDDDK)  from  the  Padhi  promoter  in  a  high- 
copy,  LEU2-containing  vector.  The  MB6,  MRl  or  MR2  mutations 
(Figure  IB)  were  introduced  into  pNTFlagUBRl  by  replacing  its  2.9  kb 


Bglll-Xbal  insert  with  its  counterparts  from  pRS315(-X/7^7l)UBRlMB6, 
pRS315(-X/7flI)UBRlMRl  or  pRS315(-XM)UBRlMR2,  yielding, 
respectively,  pNTFlagUBRl  MB  6,  pNTFlagUBRl  MRl  and  pNTFlag- 
UBR1MR2. 

UBC2  and  UBC2j^j4g  alleles  containing  the  native  Pubc2  promoter 
(-500  bp  of  the  5'  region)  were  amplified  by  PCR.  The  5acI-5fl/I-cut 
PCR  products  were  subcloned  into  Sacl-Sall-cut  p413GALl  (Mumberg 
et  al.,  1994).  Both  wt  Ubc2p  and  Ubc2pi_]49  expressed  from  these 
vectors  were  able  to  complement  the  UV-sensitivity  phenotype  of 
ubc2A  cells  (data  not  shown).  The  PCR  primers  for  the  wt  UBC2  and 
the  ubc2j_j4g  were,  respectively,  acgtgagctcccctaattgaattgttaagcg  and 
ccttgtcgactcagtctgcttcgtcgtcgt,  and  acgtgagctcccctaattgaattgttaagcg  and 
gctagtcgactcaccaagatttctctaccgtctc. 

GST  binding  assays 

The  purified  GST-Ubc2p  fusion  protein  (-1  ^g)  was  diluted  into  0.5  ml 
of  the  loading  buffer  (1%  Triton  X-100,  10%  glycerol,  0.5  M  NaCl, 
1  mM  EDTA,  50  mM  Tris-HCl  pH  8.0),  incubated  with  10  jLil  (bed 
volume)  of  the  glutathione-agarose  beads  (Sigma,  St  Louis,  MO)  for 
1  h  at  4°C,  and  washed  three  times  with  1  ml  of  the  binding  buffer 
(10%  glycerol,  50  mM  NaCl,  50  mM  Na-HEPES  pH  7.5).  Washed 
beads  were  incubated  with  extracts  from  ^^S-labeled  cells 
(-5  X  10^  CCl3COOH-insoluble  counts/min/extract)  containing  the  vec¬ 
tor  alone  or  a  plasmid  expressing  one  of  the  UBRl  alleles,  including  wt 
FLAG-Ubrlp  (specifically,  pNTFlagUBRl,  pNTFlagUBRl  MB  6, 
pNTFlagUBRl  MRl  or  pNTFlagUBRl  MR2)  at  4°C  for  1  h.  The  incuba¬ 
tion  buffer  contained  1  mg/ml  of  the  ovalbumin  carrier  (Sigma,  St  Louis, 
MO)  in  the  binding  buffer.  Beads  were  washed  three  times  with  the 
binding  buffer.  Bound  proteins  were  eluted  with  the  loading  buffer 
containing  2%  SDS  and  fractionated  by  SDS-8%  PAGE. 

Co-immunoprecipitation-immunoblotting 

Ubc2p  bearing  the  N-terminal  HA  epitope  was  expressed  from  the  Pqali 
promoter  in  a  low -copy  number  vector  p416GALl  (Mumberg  et  al., 
1994).  AVY301  cells  {ubrl A  ubc2A)  containing  a  plasmid  expressing 
HA-tagged  Ubc2p  or  a  plasmid  expressing  FLAG-Ubrlp,  or  plasmids 
expressing  HA-tagged  Ubc2p  and  one  of  the  UBRl  alleles,  including 
pNTFlagUBRl,  pNTFlagUBRlMB6,  pNTFlagUBRl  MRl  or  pNTFlag- 
UBR1MR2  were  cultured  in  raffinose  medium  to  an  OD^oo  of  0.8-1. 0, 
then  switched  to  galactose  medium  for  induction  of  Ubc2p  expression 
for  6  h.  Cells  were  harvested  and  resuspended  in  DB  buffer  (50  mM 
NaCl,  1  mM  EDTA,  50  mM  Na-HEPES  pH  7.5)  containing  1 X  protease 
inhibitor  mix  (Boehringer  Mannheim),  and  thereafter  lysed  by  vortexing 
with  glass  beads  (Bartel  et  al.,  1990).  For  the  co-immunoprecipitation 
of  Ubc2p  with  Ubrlp  (Figure  2B,  top  panel),  supernatants  were  incubated 
with  anti-HA  antiserum  (Babco,  Berkeley,  CA),  followed  by  SDS-8% 
PAGE  and  immunoblotting  with  a  monoclonal  anti-FLAG  antibody 
(Sigma,  St  Louis,  MO).  The  expression  levels  of  Ubrlp  and  Ubc2p  in 
these  transformants  were  monitored  by  immunoblotting  with  anti-FLAG 
antibody  (Figure  2B,  middle  panel)  and  anti-HA  antibody  (Figure  2B, 
bottom  panel),  respectively.  SDS— 8%  PAGE  was  used. 

Metabolic  labeling  and  immunoprecipitation 

Saccharomyces  cerevisiae  cells  from  10  ml  cultures  (OD6oo  0-8-L2)  in 
SD  media  containing  0.1  mM  CUSO4  were  labeled  for  30  min  at  30°C 
with  0.15  mCi  of  P^SJmethionine/cysteine  (EXPRESS,  New  England 
Nuclear,  Boston,  MA).  The  same  conditions  were  used  to  label  FLAG- 
Ubrlp  and  its  derivatives  for  the  GST  pull-down  assays,  except  that  no 
CUSO4  was  added.  To  assay  the  in  vivo  ubiquitylation  of  N-end  rule 
substrates,  cells  were  labeled  for  6  min,  followed  by  the  addition  of  2  M 
NEM  in  dimethylformamide  to  a  final  concentration  of  50  mM  (Johnsson 
and  Varshavsky,  1994).  Two  methods  were  used  to  prepare  cell  extracts. 
For  the  GST  pull-down  assays,  cell  extracts  were  prepared  as  described 
above  for  co-immunoprecipitation  immunoblots.  For  the  experiments 
shown  in  Figures  3C,  4B,  5A  and  6,  labeled  cells  were  pelleted,  and 
resuspended  in  0.8  ml  of  buffer  A  (1%  Triton  X-100,  0.15  M  NaCl, 

1  mM  EDTA,  50  mM  Na-HEPES  pH  7.5)  containing  protease  inhibitors, 
and  were  lysed  as  above.  The  extracts  were  centrifuged  at  12  000  g  for 
10  min,  and  supernatants  containing  equal  amounts  of  CCI3COOH- 
insoluble  ^^S  were  used  for  the  GST  assays  described  above  or  for 
immunoprecipitations.  To  detect  Ubrlp  (Figures  2A  and  5A,  lower 
panels),  immunoprecipitations  were  carried  out  with  a  monoclonal  anti- 
FLAG  antibody  conjugated  to  agarose  beads  (Kodak,  Rochester,  NY), 
followed  by  SDS~8%  PAGE.  For  the  co-immunoprecipitation  of  Leu- 
pgal  with  Ubrlp  (Figure  5 A,  top  panel),  supernatants  were  incubated 
with  a  monoclonal  anti-pgal  antibody  (Promega,  Madison,  WI),  followed 
by  SDS-8%  PAGE.  For  measurements  of  X-pgal  stability  using  the 
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UPR  technique  (Figures  3C,  4B  and  6),  a  mixture  of  an  anti-HA 
antiserum  (Babco,  Berkeley,  CA)  and  the  monoclonal  anti-pgal  antibody 
was  incubated  with  -labeled  cell  extracts.  Immunoprecipitated  proteins 
were  separated  by  SDS-12%  PAGE.  For  detection  of  the  in  vivo 
ubiquitylation,  cells  labeled  for  6  min  were  lysed  using  the  SDS-boiling 
method  (Hochstrasser  and  Varshavsky,  1990).  Briefly,  a  pellet  of  NEM- 
treated  cells  (see  above)  was  mixed  with  an  equal  volume  of  the  2X 
SDS  buffer  at  100°C  (2%  SDS,  30  mM  dithiothreitol,  90  mM  Na- 
HEPES  pH  7.5)  plus  50  mM  NEM,  and  incubated  at  100°C  for  5  min. 
The  extracts  were  diluted  10-fold  with  buffer  A  plus  protease  inhibitors 
and  50  mM  NEM.  CClsCOOH-insoluble  was  determined,  and 
samples  containing  equal  amounts  of  were  immunoprecipitated  with 
the  monoclonal  anti-pgal  antibody,  followed  by  SDS-6%  PAGE. 
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The  split-ubiquitin  technique  was  used  to  detect  transient  protein  interactions  in  living 
cells.  Nub/  the  N-terminal  half  of  ubiquitin  (Ub),  waq fused  to  Sec62p,  a  component  of  the 
protein  translocation  machinery  in  the.  endQplasmiq  reticulum  of  Saccharomyces  cerevisiae. 
Cub,  the  C-terminal  half  of  Ub,  was  fused  to  the  C  terminus  of  a  signal  sequence.  The 
reconstitution.o£a.quasi-native  Ub  structure  from  the  two  halves  of  Ub,  and  the  resulting 
cleavage  by  Ub-specific  proteases  atrthe  C  terminus  of  Cub,  serve  as  a  gauge  of  proximity 
between  the  two  test  proteins  linked  to  Nub  ^ub-  Using  this  assay,  we  show  that 
Sec62p  is  spatially  close  to  the  sigrial^sequence  of  the  prepro-a-factor  in  vivo.  This 
proximity  is  confined  to  the  nascentipdJypqptide  chain  immediately  following  the  signal 
sequence.  In  addition,  the  extent  6f  proximity  depends  on  the  nature  of  the  signal 
sequence.  Cub  fusions  that  bore  the  sij^al  sequence  of  invertase  resulted  in  a  much  lower 
Ub  reconstitution  with  Nub-Sec62p  than  otherwise  identical  test  proteins  bearing  the 
signal  sequence  of  prepro-a-factor.  An  inactive  derivative  of  Sec62p  failed  to  interact 
with  signal  sequences  in  this  assay.  These  in  vivo  findings  are  consistent  with  Sec62p 
being  part  of  a  signal  sequence-binding  complex. 


INTRODUCTION 

A  critical  step  during  the  translocation  of  a  protein 
across  the  membrane  of  the  endoplasmic  reticulum 
(ER)  is  the  interaction  between  the  signal  sequence  of 
a  nascent  pol5q)eptide  and  its  receptors  (Walter  et  ah, 
1981;  Gilmore  and  Blobel,  1985;  Walter  and  Johnson, 
1994).  A  stretch  of  8  to  12  hydrophobic  residues,  often - 
at  the  N  terminus  of  a  prot^,  comprises  a. signal... 
sequence  that  is  sufficient  to  initiate  the  protein's  ' 
translocation  into  the  endoplasmic  reticulum  (ER) 
(Rapoport  et  ah,  1996).  To  be  compatible  with' a  high 
flux  of  polypeptides  through  a  limited  nurnber.of 
translocation  channels  in  the  ER  membrane,  the  inter-:  *' 
action  between  the  signal  sequence  and  its  receptors 
has  to  be  short  lived.  Its  transient  nature  makes  such  a 
receptor-ligand  interaction  difficult  to  study,  espe¬ 
cially  in  living  cells.  The  approaches  used  for  the 

*  Corresponding  author.  E-mail  address;  johnsson@mpiz- 
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analysis  of  protein  translocation  in  cell-free  systems 
circumvent  the  transience  of  the  signal  sequence-re¬ 
ceptor  interaction  by  pausing  or  stopping  the  synthe¬ 
sis  of  a  nascent  polypeptide  chain  at  different  stages  of 
its  movement  to  and  across  the  ER  membrane  (Krieg  et 
a\.,  1986;  Kurzchalia  et  al,  1986;  Connolly  et  al,,  1989). 
Given  these  constraints,  it  is  essential  to  verify  in  vivo 
the  models  derived  from  in  vitro  studies.  The  ability  to 
analyze  early  translocation  events  in  vivo  should  also 
be  important  for  defining  the  immediate  environment 
of  the  nascent  chain  on  its  path  from  the  ribosome  to 
the  ER  membrane. 

Most  of  the  current  methods  for  detecting  protein  in¬ 
teractions  in  vivo  either  do  not  operate  at  the  ER  mem¬ 
brane  or  are  unable  to  detect  a  transient  proximity  be¬ 
tween  proteins  (Fields  and  Song,  1989;  Aronheim  et  al, 
1997;  Rossi  et  al,  1997;  Miyawaki  et  al,  1997).  In  the 
present  work,  we  show  that  the  previously  developed 
split-ubiquitin  (split-Ub)  technique,  also  called  USPS 
(Ub/split/protein/sensor)  (Johnsson  and  Varshavsky, 
1994a,  199!^,  is  capable  of  detecting  a  transient  in  vivo 
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Figure  1.  The  spUt;LIb_tedmiqw..and  its  application  to  the  analysis 
of  protein  translocation.  (A)  N„t,  and  C„b  are  linked  to  the  interact- 
i%  proteins  X  and  Y.  The  X-Y  complex  brings  and  C^i,  into 
cl^e  proximity.  N„b  and  C„b  reconstitute  a  quasi-native  Ub  moiety, 
VN^ch  is  cleaved  by  the  UBPs,  yielding  the  free  reporter  R  (Johnsson 
^594a).  (B)  Uslrjg  split-Ub  to  monitor  the  pfbximifv” 
b^eefi  a  secretory  protein  _and_a  component  of  the  translocatidn 
mSchineiy.  A  signal  sequence-bearing  C^b  fusion  (SS-Y-C„b-R)  and 
a  blub  fusion  (N„b-X)  are  coexpressed  in  a  cell.  Pathway  1:  when  N„b 
is  Hnked  to  a  protein  not  involved  in  the  targeting  for  translocation, 
the  uncleaved  (except  for  the  signal  sequence  SS)  Y-C^b-R  enters  the 
ER.  Pathway  2:  when  N„b  is  linked  to  a  protein  involved  in  the 
targeting  for  translocation  the  signal  sequence  of  the  SS-Y-Qb-R 
brings  N^b  and  C^b  into  close  contact.  As  a  result,  some  oAhe 
SS-Y-Cub-R  and  N^b-X  molecules  Interact  to  form  the  quasi-native 
Ub,  yielding  the  free  reporter  R  in  the  cytosol. 

interaction  between  polypeptides.  The  split-Ub  method 
is  based  on  the  ability  of  and  the  N-  and 
C-terminal  halves  of  Ub,  to  assemble  into  a  quasi-native 
Uh.  Ub-specific  proteases  (UBPs),  which  are  present  in 
all  eukaryotic  cells,  recogruze  the  reconstituted  Ub,  but 
ndt  its  halves,  and  cleaveTHedJb  moiety  off  a  reporter 
p^tein  that  had  beerniinkeittoihe  C  terminus  of  0^^. 
T^e  liberation  of  thejregg^^ggaes  as  a  readout  indi- 
ca^g  the  reconstitution  oLUhJThe  assay  is  designed  in 
a  |vay  that  prevents  efKcknt  association  of  N^b  and  Qb 
bg  themselves,  but  allows  iFif  ' the  two  Ub  halves  are 
s^arately  linked  to  prqteii^  that  interact  in  vivo  (Figure 
M).  The  spUt-Ub  assay'has  been  shown  to  detect  the  In 
vivo  dimerization  of  a  leucine  zipper-containing  domain 
of  the  Gcn4p  transcriptional  activator,  and  the  in  vivo 
interaction  between  two  subunits  of  the  oligosaccharyl- 
transferase  complex  0ohnsson  and  Varshavsky,  1994a; 
Stagljar  et  al,  1998). 

In  the  present  work,  we  focus  on  the  interaction 
between  Sec62p  of  the  yeast  Saccharomyces  cerevisiae 


and  proteins  bearing  two  different  signal  sequences. 
Extensive  evidence  indicates  that  Sec62p  is  a  compo¬ 
nent  of  the  ER  translocation  machinery  (Deshaies  and 
Schekman,  1989;  Rothblatt  et  al,  1989;  Miisch  et  al, 
1992).  Sec62p  is  a  part  of  the  tetrameric  Sec62/63 
complex  that  also  contains  SecZlp  and  Sec72p  (De¬ 
shaies  et  al,  1991;  Feldheim  and  Schekman,  1994). 
Sec62/63p  can  be  isolated  as  a  tetramer,  or  as  a  part 
of  a  larger  assembly,  the  heptameric  Sec  complex 
(Panzner  et  al,  1995).  In  addition  to  the  Sec62/63 
complex,  the  heptamer  contains  the  trimer  of  Sec61p. 
This  trimer  (Sec61p,  Ssslp,  Sbhlp  in  yeast;  Sec61a, 
Sec61j6,  Secbly  in  mammals)  forms  the  aqueous  chan¬ 
nel  through  which  a  polypeptide  chain  is  translocated 
across  the  ER  membrane  (Simon  and  Blobel,  1991; 
Gorlich  et  al,  1992;  Crowley  et  al,  1993, 1994;  Mothes 
et  al,  1994;  Hanein  et  al,  1996;  Beckmarm  et  al,  1997). 

The  role  of  the  Sec62/63  tetramer  is  less  well  de¬ 
fined.  Crpss-linking  and  reconstitution  experiments  in 
vitro  have  shown  that  Sec62p  is  close  to  the  nascent 
polypeptide  chain  before  the  initiation  of  its  translo¬ 
cation  (Miisch  et  al,  1992;  Lyman  and  Schekman,  1997; 
Matlack  et  al,  1997).  One  important  role  of  Sec63p  is  its 
ability  to  recruit  the  Hsp70-type  protein  Kar2p  of  the 
ER  lumen  to  the  vicinity  of  a  translocating  polypeptide 
(Brodsky  and  Schekman,  1993;  Lyman  and  Schekman, 
-1997).  The^ch2/63-complex  is  essential  for  the  post-" 
translationaP  translocation  of  proteins  in  reconstituted 
vesicle  preparations  (Panzner  et  al,  1995).  Genetic 
analysis  supports  this  conclusion,  by  showing  that  the 
tetrameric  Sec62/ 63  complex  is  involved  in  the  trans¬ 
location  of  proteins  whose  targeting  to  the  ER  mem¬ 
brane  is  not  abolished  by  the  loss  of  the  signal  recog¬ 
nition  particle  (SRP)  (Ng  et  al,  1996).  However,  it  is 
less  clear  whether  the  Sec62/ 63  complex  is  the  recep¬ 
tor  for  the  signal  sequences  of  those  proteins. 

In  the  present  work,  we  demonstrate  the  ability  of 
the  split-Ub  assay  to  detect  transient  protein  interac¬ 
tions  in  living  cells.  We  show  that  the  assay  can  mon¬ 
itor  a  close  proximity  between  Sec62p  and  a  segment 
of  the  nascent  chain  of  a  signal  sequence-bearing  pro¬ 
tein.  The  apparent  extent  of  this  proximity  is  influ¬ 
enced  by  the  nature  of  the  signal  sequence  and  the 
position  of  G^b  in  the  nascent  polypeptide.  Our  anal¬ 
ysis  yields  a  .crude  map  of  the  environment  of  the 
nascent  chain  during  its  targeting  to  and  translocation 
across  the  ER  membrane.  Taken  together,  these  find¬ 
ings  are  the  first  in  vivo  evidence  that  Sec62p,  a  com¬ 
ponent  of  the  translocation  machinery  in  the  endo¬ 
plasmic  reticulum,  is  a  part  of  a  signal  sequence- 
binding  complex. 

MATERIALS  AND  METHODS 
Construction  of  Test  Proteins 

The  Cub  fusions  8-13  (Figure  2)  were  derived  from  the  construct  I  of 
Johnsson  and  Varshavsky  (1994a),  which  encoded  Ub-DHFR-ha  and 
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j- Figure  2.  and  Cub  test  fusions.  (A)  N^b  (residues  1-36.  of  Ub)  was  fused  to  the  N  terminus  of  either  a  transmembrane  protein  (constructs 
^  1-6)  or  a  cytosolic  protein  (construct  7).  The  N  termini  of  all  proteins  are  located  in  the  cytosol.  The  orientations  and  the  numbers  of 
I'membrane-spanning  regions  (shaded  boxes)  were  derived  from  the  published  studies  of  these  proteins,  except  for  Stel4p,  for  which  the  exact 
I  number  of  the  domains  and  ^e  localization  of  the  C  terminus  a^.  npt  yet  known.  The  N^g  fusions  1-5  are  located  in  the  membrane  of  the 
^'ER;  the  N^b  fusion  6  resides  in  the  membrane  of  the  early  GolgiflTfe  N^b  fusion  2  is  a  Sec62p  derivative  lacking  the  C-terminal  60  residues. 
’  The  N^b  hision  7  contains  the  full-length  triosephosphate  isoihefase ’(N^b-Tpilp)-  (B)  C^b  fusions.  The  C^b  fusions  8  and  9  contain  the  signal 
sequence  of  the  prepro-a-factor  (shaded  boxes),  followed  by  either  37  (construct  8)  or  65  residues  (construct  9)  of  the  mature  a-factor  sequence 
(striped  boxes)  and  a  7-residue  linker  sequence  (not  shown).  Cub  fusions  10-13  contain  the  signal  sequence  of  the  Suc2p  invertase  (dark  boxes) 
followed  by  23  (construct  10),  33  (construct  11),  59  (construct  12),  or  518  residues  (construct  13)  of  the  mature  sequence  of  invertase  (open 
boxes)  and  a  7-residue  linker  sequence  (not  shown).  The  Cub  fusion  14  contains  the  complete  sequence  of  S.  cerevisiae  triosephosphate 
isomerase  (Tpilp)  followed  by  a  17-residue  linker  peptide  and  Cub-  The  Cub  fusions  15  and  16  are  the  signal  sequence-lacking  coimterparts 
of  the  fusions  10  and  12.  Cub  is  always  followed  by  a  reporter  protein.  The  reporter  is  DHFR-ha  or  Ura3p  for  the  Cub  fusions  8-13,  and 
DHFR-ha  for  the  Cub  fusions  14,  15,  and  16. 


contained  a  BamHl  site  at  the  amino  acid  position  36  of  Ub,  and 
from  the  previously  described  Ub  translocation  constructs  I,  VI,  IX, 
:;X,  XXIII,  and  XXV  (Johnsson  and  Varshavsky,  1994b).  The  above 
BamHl  site  of  the  Ub-DHFR-ha  construct  I  was  fused  to  a  linker 
sequence  in  which  a  5'-Sfl/I  site  allowed  the  in-frame  insertion  of  an 
Eagl-Sali  fragment  containing  the  promoter,  the  signal  sequence, 
and  a  portion  of  the  mature  sequence  of  the  corresponding  Ub 
H  fusions.  The  newly  introduced  sequence  was  G  TCG  ACC  ATG 
^TCG  GGG  GGG  ATC  CCT.  Tl\e  last  three  triplets  encode  r^dues;; 
tl35,  36,  and  37  of  Ub  (the  beginning _of  Cub)-  The  underlined  se- 
^quences  are  the  SaR  and  BamHl  sites,  respectively.  The  final  con- 
f  structs  were  in  the  single-copy  plasmids  pRS314  or  pRS315  (Sikorskl 
land  Hieter,  1989).  Expression  of  the  Cub  fusions  bearing  Dha- 

4  (DHFR-ha)  as  a  reporter  was  mediated  by  the  Padhi  promoter,  '' 

5  except -for  the  Cub  fusion  14,  which  was  expressed  from  the  P^p^' 
5-prbmbter.  The  same  promoter  was  used  for  expressing  the  Uf a3p-  i 

based  Cu^  fusions. 

The  Cub  fusions  15  and  16  (Figure  2)  were  derived  from  constructs 
10  and  12  by  deleting  the  Hindlll  fragment  spanning  the  first  four 
codons  of  the  SUC2  ORF  and  a  short  portion  of  the  polylinker 
; sequence  between  the  3'-end  of  the  Padhi  promoter  and  the  SUC2. 
ORF.  As  a  result,  the  translation  of  Cub  fusions  15  and  16  began  at 
the  first  codon  of  the  mature  invertase,  skipping  its  signal  sequence. 
The  Cub  fusion  14  (Figure  2)  was  produced  through  an  in-frame 
fusion  of  a  PCR  fragment  containing  the  complete  TPIl  coding 


sequence  and  Cub'Dha.  The  sequence  between  TPll  and  Cub  is  as 
follows:  AAC  GGG  TCG  ACC  GAC  TAC  AAG  GAC  GAC  GAT 
GAC  AAG  GGC  TCG  ACC  ATGTCG  GGG  GGG  ATC  CCT.  The 
underlined  sequences  indicate,  respectively,  the  last  codon  of  TPIl 
and  the  first  three  codons  of  C.b- 
A  fragment  encoding  Nub“Sec62p  was  constructed  using  PCR 
amplification  of  a  1050  base  pair  (bp)  fragment  containing  the  SEC62 
ORF.  PCR  introduced  a  BamHl  site  and  a  linker  sequence  in  front  of 
the  start  codon  of  SEC62  and  an  Xhol  site  173  bp  downstream  of  the 
stop  codon.  The  Pcupi"Nub  modules  were  cloned  as  BamHl  frag¬ 
ments  in  frame  with  the  SEC62  ORF.  The  sequence  between  Nub 
SEC62  is  GGG  ATC  CCT  TCT  GGG  ATG.  The  first  three  codons 
encode  residues  35, 36,  and  37  of  Nub/  followed  by  the  Gly-Ser  linker 
and  the  start  codon  of  SEC62.  The  BamHl  site  is  underlined.  The 
_  final  constructs  resided  in  pRS316  or  pRS313.  Nub-TPIl,  Nub-SED5, 
Nub-ST£14  (a  gift  from  N.  Lewke),  and  Nub-Sec62(AC60)-Dha  were 
constructed  similarly  to  Nub-5EC62.  With  the  exception  of  Nub“ 
Sec62(AC60)-Dha,  which  was  placed  in  pRS316  and  pRS313,  all  of 
these  fusions  resided  in  pRS314.  The  linker  connecting  codon  35  of 
Nub  the  first  codon  of  a  linked  gene  was  GGG  ATC  CCT  GGG 
GAT  ATG  for  Nub-TP/1  and  Nub-SEDS,  and  GGG  ATC  CCT  GGG 
GAT  CAC  for  Nub-STEI4.  Underlined  are  the  BamHl  site  and  the 
first  codon  of  the  linked  gene.  The  sequence  GGG  TCG  ACC  TTA 
ATG  CAG  AG  A  TCT  GGC  ATC  ATG  GTT  connected  the  last  codon 
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Table  1.  Yeast  strains 


YPH500 

JD53 

RSY529 

NJY51 

NJY62-1 

NJY73-I 

NJY74-I 

NJY61-I 

NJY61-A 

NJY61-G 


Relevant  genotype 

MATa  ade2~101  his3-A200  Ieu2-M  lys2-801  trpl-A63  ura3-52 

MATa  his3-A200  leii2-3,112  lys2-801  trpl-A63  ura3-52 

MATa  his4  leii2~3,112  ura3~52  sec62-l 

MATa  ade2-101  his3-A200  Ieu2-M  lys2-801  trpl-A63  ura3-52 

NUB-BOSl::pRS306 

MATa  his3-A200  leu2-3,112  lys2~801  trpl-A63  ura3-52 
NUB~SEC62::pRS306 

MATa  his3-A200  leu2-3,112  lys2-801  trpl-A63  ura3~52 
NUB-BOSl::pRS303 

MATa  his3-A200  leu2~3J12  lys2-801  trpl-A63  ura3~52 
NUB-BOSl::pRS306 

MATa  his3~A200  leu2-3,112  lys2-801  trpl-A63  ura3-52 
NUB-SEC61::pRS304 

MATa  his3-A200  leu2-3,112  lys2-801  trpl-A63  ura3-52 
NUA-SEC61::pRS304 

MATa  his3-A200  leu2-3,112  lys2-801  trpl-A63  ura3-52 
NUG-SEC61::pRS304 


Source/ comment 

Sikorski  and  Hieter  (1989) 
Dohmen  et  al  (1995) 

Deshaies  and  Schekman  (1989) 
Derivative  of  YPH500 

Derivative  of  JD53 

Derivative  of  JD53 

Derivative  of  JD53 

Derivative  of  JD53 

Derivative  of  JD53 

Derivative  of  JD53 


of  SEC62  in  Nub-Sec62(AC60)-Dha  (codon  223,  underlined)  to  the 
first  two  codons  of  DHFR  (underlined). 

N^^rBOSl  was  constructednn^rr  by  PCR  amplification,  with 
two  synthetic  oligos  and  yeast  genomic  DNA  as  a  template,  yielding 
a  258-bp  fragment  containing  the  first  229  bp  of  the  BOSl  ORR 
Upstream  of  the  BOSI-ATG  -was  a  short  linker  sequence  and  a 
— si^^  to  allow  m-frame  fusion  .of  the  Pcupi  promoter-N^i, 
-module.  The  sequence  belween^N^^t  and  BOSl  reads:  GGG  ATC 
CGT  CCA  GGA  ATGrTh^irstTouf  triplets  encode  residues  35,  36, 
37,  and  38  of  followed  by  the  Gly  codon  and  the  start  codon  of 
BOSl.  The  BamHl  site  is  underlined.  The  3 '-region  of  the  resulting 
fragment  terminated  in  a  SaU.  site  for  insertion  into  the  integrating 
vectors  pRS306  or  pRS303.  The  vector  was  cut  at  the  unique  EcoRI 
site  in  the  ROS2 -containing  fragment  and  transformed  into  S.  cer- 
evisiae  strains  YPH500  and  JD53  to  produce,  through  homologous 
recombination,  the  integrated  cassette  that  expressed  N^b-Boslp 
from  the  Pcup:  promoter.  The  presence  of  the  desired  gene  fusion 
and  the  absence  of  wild-type  BOSl  were  verified  by  PCR, 

An  integrated  copy  of  was  produced  by  ampli¬ 

fying  the  first  438  bp  of  the  SEC62  ORF,  and  then  cloning  it,  using 
the  BaniHl  and  EcoRI  restriction  sites,  in  frame  behind  the  pRS306- 
Pcwi-Nub  cassette.  A  unique  Aflll  site  in  the  SEC62  ORF  was  used 
to  linearize  the  plasmid  for  transformation  and  integration  at  the  S. 
cerevisiae  SEC62  gene,  yielding  the  strain  NJY62-I.  The  N-terminal 
147-residue  fragment  of  Sec62p  that  was  coexpressed  with 
Sec62p  in  the  resultingjstrairiihas^previously  been  shown  to  be 

was  constructed  by  targeted  i^  of  'a  N^b-SEC63-^ontai^M^ 

fragment  into  SEC61  -oLthe^S^reofsige-stfain  JD53  (Table  1).  Spe¬ 
cifically,  a  fragment  containing-the-fet-875  bp  of  the  SEC61  ORF 
was  amplified  by  PCR  and  inserted  downstream  of  the  pRS304-  or 
pRS303-based  Pcupr^ub^  cassefle,  using  the  flanking  BamHl  and 
EcoRI  sites.  The  linker  sequenceJoeteeruN^b  and  SEC61  was  GGG 
ATC  CCT  GGG  TCT  GGG  ATG~ -Underlined  are  the  BamHl  site  and 
the  first  codon  of  SEC61.  For  targeted  integration,  the  plasmid  was 
linearized  at  the  unique  Stul  site  in  the  SEC61  ORF  to  create  the 
yeast  iSTfYEl-I.  A  detailed  description  of  the  NJY61  strains  (Table  1) 
will  be  presented  elsewhere  (Wittke  and  Johnsson,  unpublished 
data). 

All  of  the  Pcupi  promoter-controlled  ORFs  were  expressed  under 
noninducing  conditions  (no  copper  added  to  the  medium),  except  in 
the  experiment  shown  in  Figure  5B,  where  cells  were  incubated  in 
the  presence  of  0.1  mM  CUSO4. 


Immunoblotting 

Proteins  fractionated  by  SDS-12.5%  PAGE  were  electroblotted  onto 
nitrocellulose  (Schleicher  &  Schuell,  Dassel,  Germany)  or  polyvi- 
nylidene  difluoride  (Machery-Nagel,  Duren,  Germany)  membranes, 
using  the  semidry  transfer  system  (Hoeffer  Pharmacia  Biotech,  San 
Francisco,  CA).  Blots  were  incubated  with  an  anti-ha  monoclonal 
antibody  (Babcd,  Richmond,  CA)  and  visualized'using  horseradish 
peroxidase-coupled  goat  anti-mouse  antibody  (Bio-Rad,  Hercules, 
CA),  the  chemiluminescence  detection  system  (Boehringer,  Mann¬ 
heim,  Germany),  and  x-ray  films.  Where  indicated,  quantification 
was  performed  using  the  Lumi  Imager  system  (Boehringer). 

Pulse-Chase  Analysis 

Yeast-rich  (YPD)  and  synthetic  minimal  media  with  2%  dextrose 
(SD)  were  prepared  as  described  previously  (Dohmen  et  al,  1995).  S. 
cerevisiae  cells  expressing  the  N„b  and  C„b  fusions  were  grown  at 
30°C  in  10  ml  of  SD  medium  without  externally  added  copper  to  an 
ODgoo  of  ~1  and  labeled  for  5  min  with  Redivue  Promix-f^^S] 
(Amersham,  Buckinghamshire,  United  Kingdom),  followed  (either 
directly  or  after  a  chase)  by  immunoprecipitation  with  the  anti-ha 
monoclonal  antibody,  essentially  as  described  by  Johnsson  and 
Varshavsky  (1994a,b).  The  EndoH  analysis  of  glycosylated  proteins 
was  carried  out  as  described  by  Orlean  et  al  (1991).  Samples  were 
concentrated  before  SDS-12.5%  PAGE  by  precipitation  with  chloro¬ 
form/  methanol.  Gels  were  fixed  and  enhanced  for  fluorography. 
For  quantitative  analysis,  a  dried  gel  was  exposed  and  scanned 
using  a  ITiosphorlmager  (Molecular  Dynamics,  Sunnyvale,  CA). 


RESULTS 

Experimental  Strategy 

The  use  of  split  Ub  to  monitor  the  proximity  between 
the  proteins  X  and  Y  requires  the  construction  of  two 
"complementary"  fusion  proteins.  One  fusion  bears 
N^b  (see  INTRODUCTION)  linked  to  X  (N^b-X)  and 
the  other  bears  C^b  linked  to  both  Y  and  a  reporter 
protein  R  at  the  C  terminus  of  C^b  XY-C^b“R)-  The 
liberation  of  the  reporter  through  the  Ub-dependent 
cleavage  by  UBPs  indicates  the  in  vivo  reconstitution 


332 


Molecular  Biology  of  the  Cell 


? 


of  a  quasi-native  Ub  from  and  C^b-  In  the  split-Ub 
assay,  the  efficiency  of  cleavage  at  the  C  terminus  of 
C^b  in  Y-Cub“R  is  measured  relative  to  the  efficiency  of 
cleavage  observed  with  selected  reference  (control) 
proteins  (Figure  1). 

To  monitor  protein  interactions  during  translocation 
of  a  protein  across  the  ER  membrane,  N^b  fused  to 
the  N  terminus  of  a  membrane  protein  that  is  a  part  of 
the  translocation  machinery  (Figure  1).  Owing  to  the 
constraint  of  the  assay,  which  requires  the  cytosolic 
location  of  the  reconstituted  Ub,  the  N  terminus  of  this 
membrane  protein  must  be  located  in  the  cell's  cy¬ 
tosol  Sec62p  has  an  N-terminal  cytosolic  domain  of 
158  residues,  which  is  followed  by  two  membrane- 
spanning  segments  and  a  C-terminal  segment  also 
facing  the  cytosol  (Deshaies  and  Schekman,  1990).  N^b 
was  therefore  fused  to  the  N  terminus  of  Sec62p, 
yielding  Nub“Sec62p.  C^b  was  sandwiched  between 
the  56  N-terminal  residues^  of  the  precursor  of  S.  cer- 
evisiae  a-factor  pheromone*  (prepro-a-factor)  and  the 
ha'^pitope^agged  mouse  dihydrofolate  reductase 
(DHFR-ha;  denoted  as  Dha)  as  a  reporter  protein, 
"yielding  Mfu^C.i^-Dha  (Figure  2).  The  cleavage  of  the 
Cub-containing  fusion  at  the  C^b-Dha  junction  was 
detected  with  a  monoclonal  anti-ha  antibody. 


Split-Ub  Detects  a  Proximity  between  a 
Translocating  ProfRn  and  Sec62p 

We  first  verified  that  Mfa37-Cub-Dha  could  be  translo¬ 
cated  across  the  ER  membrane  and  that  the  N-terminal 
extension  of  Sec62p  with  N^b  did  not  interfere  with 
the  Sec62p.  function  in  translocation.  After  a  5-min 
pulse  of  wild-type  S.  cerevisiae  with  ^^S-methionine, 
the  labeled  Mfagy-Cub-Uha  was  immunoprecipitated 
as  a  glycosylated  and  undipped  fusion  (Figure  3A). 
Thus,  Mfa37-Cub-Dha  could  indeed  be  translocated 
into  the  lumen  of  ER.  Introduction  of  the  same  Mfa37- 
Cub'Uha  construct  into  the  yeast  strain  RSY529,  which 
carries  a  temperature-sensitive  (ts)  variant  of  Sec62p 
(Rothblatt  et  ah,  1989),  confirmed  the  severe  transloca¬ 
tion  defect  of  this  strain.  About  50%  of  the  pulse- 
labeled  Mfa37-Cub-E)hFentered  the  lumen  of  the  ER  in 
this  strain  at  the  semip^rmissive  temperature  of  30°C, 
while  the  rest  remainedJn-^-cytosol  (Figure  3A). 
Thus,  the  translocation  of  Mfa37-Cub“E)ha  depends  on 
Sec62p.  This  made  it  posdble  to  determine  whether 
Nub-Sec62p  is  functionally  active.  The  test  utilized 
Nug-Sec62p,  in  which  the  N-terminal  half  of  Ub  con¬ 
tained  Gly-13  instead  of  wild-type  Ile-13.  This  deriv¬ 
ative,  denoted  as  N^g,  has  a  lower  affinity  for  C^b  than 
the  wild-type  N^b  Qohnsson  and  Varshavsky,  1994a). 
We  chose  Nug"'Sec62p  for  this  experiment  to  minimize 
the  reconstitution  of  the  Ub  moiety  through  interac¬ 
tions  between  Nub-Sec62p  and  potentially  arrested 
molecules  of  MfQ:37-Cub-C)ha,  which  might  be  localized 
in  the  cytosol.  Plasmids  expressing  Nug“Sec62p  and 


Interaction  between  Signal  Sequence  and  Sec62p 


a  b  c  d  e  f  g  h 


—  Sec62p  ^^^^^H  b 
Nut)-Sec62p  c 

Sec62(AC60)^ha  cl 

Nub-Sec62(^C60)l^l^a  |||||||||m  e 


Figure  3.  Sec62p  is  close  to  the  signal  sequence  of  the  a-factor 
precursor.  (A)  S.  cerevisiae  cells  expressing  MfasT-C^b-Dha  (construct 
8;  Figure  2)  were  labeled  for  5  min  with  ^^S-methionine.  The  ex¬ 
tracted  proteins  were  immunoprecipitated  with  anti-ha  antibody, 
followed  by  a  mock  treatment  (lanes  a,  c,  e,  and  g)  or  the  treatment 
with  EndoH  (lanes  b,  d,  f,  and  h),  and  SDS-PAGE.  The  results  with 
cells  coexpressing  N^g-  or  Nub-Sec62p  are  shown  in  lanes  c  and  d 
and  g  and  h,  respectively.  The  analysis  was  performed  with  N^g’ 
Sec62p  in  the  S.  cerevisiae  mutant  RSY529  carrying  a  ts  allele  of 
SEC62  (lanes  a-d)  or  with  Nub-Sec62p  in  the  wild-type  yeast  (lanes 
e-h).  Number  8  (following  the  numbering  of  the  constructs  in 
Figure  2)  oh  the  right  indicates  the  positions  of  uncleaved  Mfa37- 
Cub-E)ba  and  its  glycosylated  forms.  An  asterisk  denotes  an  unre¬ 
lated  yeast  protein  that  cross-reacts  with  the  anti-ha  antibody.  (B) 
Nub-Sec62p  encodes  a  functionally  active  protein.  RSY529  cells  car¬ 
rying  an  empty  plasmid  (a),  Sec62p  (b),  Nub-Sec62p  (c), 
Sec62(AC60)Dha  (d),  or  _Nub-Sec62(AC60)Dha  (e)  were  spotted  on 
minimal  media,  and  grown  for  2  d  at  30°C  (semipermissive  temper¬ 
ature  for  unmodified  RSY529). 


Mfa37-Cub-E)ha  were  cotransformed  into  RSY529  cells 
and  assayed  at  30°C.  As  in  wild-type  cells,  only  trans¬ 
located  Mfa37-Cub-E)ha,  but  virtually  no  free  Dha  or 
nontranslocated  MfQ:37-Cub-Dha,  was  detected  after 
immunoprecipitation  and  EndoH  treatment  of  the 
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cells  that  had  been  labeled  for  5  min  with  ^^S-methi- 
onine  (Figure  3A). 

To  test  Nui,-Sec62p  directly  (N^b  is  the  wild-type  half 
of  Ub,  containing  lie  at  position  13),  we  examined  its 
ability  to  complement  the  growth  defect  of  RSY529 
cells.  RSY529  cells  expressing  Nub-Sec62p  were  found 
to  grow  at  the  semipermissive  temperature  of  30°C,  in 
contrast  to  congenic  cells  carrying  a  control  plasmid 
(Figure  3B),  To  verify  that  the  suppression  of  the  ts 
phenotype  was  not  due  to  the  initiation  of  translation 
from  the  first  (internal)  ATG  codon  of  Sec62p  within 
theN„b  -Sec62p  fusion,  the  rescue  experiment  was  suc¬ 
cessfully  repeated  with  the  otherwise  identical  deriv¬ 
ative  of  Nub-Sec62p  that  lacked  the  first  ATG  of  SEC62 
(our  unpublished  results). 

A  significant  amount  of  free  Dha  was  generated 
when  Mfa37-Cut,“Dha  was  expressed  (in  either  wild- 
;type  or  RSY529  cells)  together  with  Nub-Sec62p,  which 
contained  the  wild-type  half  of  Ub  (Figure  3A  and  our 
junpublished  results).  .We_cqncluded  that  .Sec62p  is 
jclose  to  the  nascent  polypeptide  chain  during  its  trans-. 
location  into  the  ER.  .The..cleavage  at  the  C  terminus  of 
^ub  requires  its  interaction  with  N^b  and  depends  on 
the  presence  of  UBPs  0ohnsson  and  Varshavsky, 
i994a).  Since  UBPs  have  previously  been  shown  to  be 
absent  from  the  ER  (Johnsson  and  Varshavsky,  1994b), 
ihe-free  Dha  moiety  had-tobe  produced  in  the  cytospl. 
fractionation  experiments  corifirmed  that- free-; DHa 
was  absent  from  membrane-enclosed  compartments 
in  whole-cell  extracts  (our  unpublished  results).’ An 
entirely  independent  evidence  for  this  conclusion  was 
produced  by  replacing  Dha  in  Mfa37-C^b-Dha  with 
Ura3p  as  the.  reporter  moiety.  Ura3p  confers  the  Ura+ 
phenot5q?e  on  uraSL  cells  only  if  Ura3p  has  access  to 
the  cytosol  (Johnsson  and  Varshavsky,  1994b).  In  our 
tests,  the  cytosolic  Ura3p  was  produced  only  if  Mfa37- 
C„b-Ura3p  was  coexpressed  with  Nub-Sec62p  (com¬ 
pare  A  and  B  in  Figure  7),  in  agreement  with  the  other 
evidence  (see  above)  that  the  cleavage  at  the  C„b- 
protein  junction  takes  place  exclusively  in  the  cytosol. 

The  transient  nature  of  the  proximity  between 
Sec62p  and  the  nascent  chain  of  a  translocated  protein 
was  indicated  by  the  near-absence  of  the  released  Dha 
Aoie^  bMfa37-C„b-Dba  was  coexpressed  with  either 
M„g-Sec62p  or  Nu3-Sec6'2p  instead  of  N„b-Sec62p  (^(^3 
denotes  Ala  at  position  13  of  N^t);  by  contrast„.the 
^me  experiment  with  Nub-Sec62p  resulted  in  a  signif-:' 
i|ant  cleavage  of  Mfa37-C^b-Dha  (Figures  3 A  and;6C).. 
devious  work  (Johnsson  and  Varshavsky,  1994a)  hg?  . 
E^own  that  and  N„g  can  induce  significant  Lib 
reconstitution  when  either  of  them  and  C^b  are  linked 
to  polypeptides  that  form  a  stable  (long-lived)  com- 
plex  m  a  cell.  In  summary,  the  observed  absence  of 
significant  Ub  reconstitution  with  and  (in 
contrast  to  N^b)  interpreted  to  signify  a  close  but 
transient  (short-lived)  proximity  between  Sec62p  and 

^f®37"Cub"Dha . 


Specificity  of  the  Spatial  Proximity  between  a 
Signal  Sequence-bearing  Nascent  Polypeptide  and 
Sec62p 

A  commonly  used  negative  control  in  a  translocation 
assay  is  a  protein  with  a  defective  or  absent  signal 
sequence  (Allison  and  Young,  1988;  Musch  et  al, 
1992).  Such  a  control  is  not  entirely  compatible  with 
spatio-temporal  aspects  of  the  split-Ub  assay.  Specifi¬ 
cally,  a  C^b-fusion  protein  lacking  a  signal  sequence 
accumulates  in  the  cytosol  (where  the  split-Ub  assay 
operates),  whereas  an  analogous  signal  sequence¬ 
bearing  protein  is  continuously  removed  from  this 
compartment.  A  direct  comparison  between  reactions 
that  involve  a  signal  sequence-bearing  polypeptide 
and  its  signal  sequence-lacking  counterpart  requires 
the  abihty  to  compare  the  local  concentrations  of  the 
two  polypeptides  at  the  site  of  translocation.  We  are 
not  aware  of  an  in  vivo  technique  that  would  be 
independent  of  the  split-Ub  assay  and  at  the  same 
time  would  allow  a  measurement  of  these  parameters. 
Therefore,  we  devised  an  alternative  control.  The  ex¬ 
tent  of  cleavage  of  Mfa37-Cub-Dha  at  the  C„b-Dha 
junction  should  reflect  the  time-averaged  spatial  prox¬ 
imity  between  the  nascent  Mfa-chain  and  a  coex¬ 
pressed  Nub-containing  fusion.  By  comparing  the  ex¬ 
tent  of  cleavage  of  Mfq;37-Cub-Dha  in  the  presence  of 
Nub-Sec62p  (Figure  3 A)  with  the  analogous  activity  of 
Nub-fusion  proteins  that  are  not  involved  in  the  ER 
targeting  and  translocation,  we  could  assess  the  spec¬ 
ificity  of  the  reaction  between  Nub-Sec62p  and  Mfa37- 
Cub-Dha. 

Four  Nub-fusion  proteins,  N^b-Boslp,  Nub-Stel4p, 
Nub-SedSp,  and  N^b-Tpilp  were  tested  in  the  split-Ub 
assay  with  MfQ;37-Cub-Dha.  The  expected  intracellular 
locations  of  these  N^b  fusions,  and  their  predicted 
topologies  in  the  membrane  are  shown  in  Figure  2A 
(Shim  et  al,  1991;  Banfield  et  al,  1994;  Sapperstein  et 
al,  1994;  Lewke  and  Johnsson,  unpublished  data).  We 
found  that,  in  contrast  to  Nub-Sec62p,  none  of  the  four 
tested  Nub  fusions  induced  a  significant  cleavage  of 
Mfa37-Cub-Dha  (Figure  4A).  A  small  amount  of  free 
Dha  could  be  detected  in  the  immunoblots  when  N^b- 
Boslp  was  overexpressed.  The  lack  of  significant  Ub 
reconstitution  from  N^b  and  C^b  upon  coexpression  of 
Mfa37-Cub-Dha  and  the  Nub-modified  .ER  membrane 
proteins,  Boslp,  Stel4p  (Figure  4A),  and  Secl2p  (Na- 
kano  et  al,  1988;  our  unpublished  results),  confirmed 
that  the  steady-state  concentration  of  .Mfa37-Cub-Dha 
in  the  cytosol  was  extremely  low. 

To  verify  that  the  observed  absence  of  Ub  reconsti¬ 
tution  (Figure  4A)  was  not  due  to  either  low  concen¬ 
trations  of  the  tested  fusion  proteins  or  reduced  acces¬ 
sibility  of  their  linked  Nub  moieties,  we  compared  the 
activity  of  these  Nub  fusions  toward  a  cytosolic  Cub- 
fusion  protein.  Cub-Dha  was  fused  to  the  C  terminus 
of  the  cytosolic  enzyme  triosephosphate  isomerase 
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Figure  4.  The  in  vivo  proxim-  m 
ity  between  Sec62p  and  Mfagy-  ^ 

C^b-Dha  is  transient  and  spe¬ 
cific.  (A)  Immunoblot  analysis 
of  extracts  of  S.  cerevisiae  coex¬ 
pressing  the  Mfa37-Cub-Dha 
(construct  8;  Figure  2)  and  one 
of  the  following  constructs: 

Nyb“Sec62p,  integrated  (lane  a) 
or  plasmid-borne  (lane  b);  N^b* 

Boslp  (lane  c);  N^b-SteMp  (lane 
d);  Nub-Sed5p(lane  e);  and  N^b* 

Tpilp  (lane  f).  (B)  Immunoblot 
analysis  of  extracts  of  S.  cerevi¬ 
siae  expressing  the  Tpilp-C^b" 

Dha  fusion  (construct  14;  Figure 
2)  alone  (lane  a)  or  together 
with  one  of  the  following  con¬ 
structs:  Nyb-Sec62p,  either  inte¬ 
grated  (lane  b)  or  plasmid- 
borne  (lane  c);  N^b-Boslp  (lane 
d);  N,b-Stel4p  (lane  e);  N,b- 
Sed5p(lane  f);  and  N^b-Tpilp  Q 
(lane  g).  Number  14  on  the  left 

indicates  the  position  of  un-„ . 

cleaved  Tpilp-Cub-Dha.  (C)  S. 
cerevisiae  cells  expressing 
Mfa37-Cub-Dha  (construct  8; 

Figure  2)  together  with  either^*”"' 
the  vector  (lane  a),  N^b'Boslp 
(lane  b),  or  Nub”Sec62p  (lane  c) 
were  labeled  for  5  min  with  ^^S- 
methionine.  The  extracted  pro¬ 
teins  were  immunoprecipitated 
with  anti-ha  antibody  and  ana¬ 
lyzed  by  SDS-PAGE.  (D)  Quan¬ 
titation  of  the  pulse-labeling  ex¬ 
periment  (C)  using  Phosphor- 
Imager.  The  extent  of  Dha  re-  Dhp  — 
lease  in  the  presence  of  N^b" 

Sec62p  was  arbitrarily  set  at 
100.  The  averages  of  three  ex¬ 
periments  are  shown.  Lanes  a, 
b,  and  c  are  the  same  as  in  panel 
C.  (E)  S.  cerevisiae  cells  expressing  MfQ:37-Cub-Dha  together  with  N^b' 
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Sec62p  were  labeled  for  5  min  with  ^^S-methionine  and  chased  for 


5  and  15  min,  followed  by  extraction  of  proteins,  immunoprecipitation  with  anti-ha  antibody,  and  SDS-PAGE. 


(Tpilp),  yielding  Tpil-Cub-Dha  (Figure  2B).  All  of  the 
Nutj-fusion  proteins  in  Figure  2A  induced  a  significant 
release  of  Dha  from  the  test  protein  Tpil-C^b'^^ha 
(Figure  4B).  This  analysis  -  also  suggested  that 
Boslp  was  expressed  to  higher  levels  than  other 
fusions. 

To  quantify  the  relative  proximities  of  N^,b“B^sip 
and  Nub-Sec62p  to  Mfa37-C^b"Dha,  yeast  cells  were 
labeled  for  5  min  with  ^^S-methionine,  and  the'^rer 
leased  Dha  was  determined  as  described  in  the  legend 
to  Figure  4.  Coexpression  of  Nub-Sec62p  and 
Cub“Dha  yielded  —15  times  more  of  the  free  Dha  than 
coexpression  of  N^b-Boslp  and  Mfa37-  C^b-Dha  (Fig¬ 
ure  4,  C  and  D).  Assuming  that  the  N^b  moieties  in 
Nub“Sec62p  and  N^b-Boslp  were  equally  accessible  to 
the  cytosol  (Figure  4B),  we  concluded  that  the  time- 
averaged  proximity  between  the  nascent  chain  of 


Mfa37-Cub-Dha  and  the  N^b'Bearing  transmembrane 
proteins  was  much  higher  for  Sec62p  than  for  the  ER 
membrane  proteins  that  are  not  involved  in  targeting 
or  translocation.  Note  that  this  analysis  may  actually 
underestimate  the  proximity  of  Sec62p  to  the  nascent 
chain,  because  we  invariably  observed  a  more  efficient 
cleavage  of  MfQ:37-C^b"B^ha  when  Nub-Sec62p  was  the 
only  form  of  Sec62p  in  the  cell  (Figures  4A  and  6A). 
Therefore  we  interpret  the  reduced  cleavage  of  MfQ:37- 
Cub'Dha  in  the  presence  of  both  Nub-Sec62p  'and  the 
native  Sec62p  as  the  consequence  of  competition  be¬ 
tween  those  two  Sec62p-containing  species  for  either 
the  signal  sequences  of  translocated  proteins  or  the 
ligands  of  Sec62p  in  the  complex  of  Sec  proteins. 

Recent  evidence  indicates  that  misfolded  or  other¬ 
wise  abnormal  proteins  in  the  lumen  of  the  ER  can  be 
retrotransported  across  the  ER  membrane  back  into 
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the  cytosol,  where  they  are  degraded  by  the  Ub  system 
(Biederer  et  al,  1996;  Hiller  et  al,  1996;  Wiertz  et  al, 
1996).  This  retrotransport  involves  at  least  some  of  the 
components  of  the  known  ER  translocation  machinery 
(Plemper  et  al,  1997).  To  determine  whether  the  cleav¬ 
age  of  MfQ:37-Cub-Dha  at  the  C^b-Dha  junction  occurs 


Figure  5.  The  nature  of  the  sig¬ 
nal  sequence  and  its  distance 
from  determine  the  extent  of 
cleavage  of  in  the  presence 
of  N^,b-Sec62p.  (A)  S.  cerevisiae  ex¬ 
pressing  MfagT-C^b-Dha  (con¬ 
struct  8;  Figure  2)  (lanes  a  and  b), 
Mfagg-Cujj-Dha  (construct  9) 
(lanes  c  and  d),  Suc223-Q,b-Dha 
(construct  10)  (lanes  e  and  f), 
Suc233-Cub-Dha  (construct  11) 
(lanes  g  and  h),  Suc259-C^b-Dha 
(construct  12)  (lanes  i  and  j)  and 
Suc25^8-Cub-Dha  (construct  13) 
(lanes  k  and  1)  were  labeled  with 
^S-methionine  for  5  min.  The  ex¬ 
tracted  proteins  were  either 
mock-treated  (lanes  a,  c,  e,  g,  i, 
and  k)  or  treated  with  EndoH 
(lanes  b,  d,  f,  h,  j,  and  1),  followed 
by  immunopredpitation  with  an¬ 
ti-ha  antibody  and  SDS-PAGE. 

(B)  Same  as  panel  A  but  the  cells 
also  contained  Nyb-Sec62p  in  ad¬ 
dition  to  the  Cub-fusions  Mfa3^ 
C„b-Dha,  Mfa^s-Qtb'Dha,  Suc223- 
C^-Dha,  Suc233-C,jb*E)ha,  Suc25g- 
C^b-Hha,  SucZsig-Qjb'Dha  (lanes 
a-f).  The  analysis  was  carried  out 
by  immunoblotting  whole  cell 
extracts  with  the  anti-ha  antibody. 

(C)  S.  cerevisiae  cells  expressing 
Suc223-Cub*Dha  (construct  10; 
Figure  2)  (lanes  a-c)  and  SucZsg- 
C^b-Dha  (construct  12;  Figure  2) 
(lanes  d-f)  together  with  either 
Nub*-Sec62p  (lanes  b  and  e), 

Boslp  (lanes  c  and  f)  or  the  vector 
(lanes  a  and  d)  were  labeled  for  5 
min  with  ^^S-methionine.  Whole 
cell  extracts  were  immunopredpi- 
tated  with  anti-ha  antibody,  fol¬ 
lowed  by  SDS-PAGE  and 
autoradiography.  (D)  S.  cerevisiae 
cells  expressing  ASuc223-Cub-Dha 
(construct  15;  Figure  2)  or 
ASuc259-C^-Dha  (construct  16; 
Figure  2)  together  with  either  the 
vector  (first  six  lanes)  or  N^b- 
Sec62p  (last  six  lanes)  were  la¬ 
beled  for  5  min  with  ^S-methie 
nine  and  chased  for  10  and  30 
min,  followed  by  extraction,  im¬ 
munopredpitation  with  anti-ha 
antibody,  and  SDS-PAGE.  Niun- 
bers  15  and  16  indicate  the  posi¬ 
tions  of  the  corresponding  (un- 
deaved)  Qb  fusions. 


during  translocation  into  the  ER  or  during  (in  this 
case)  a  h3^othetical  retrotransport  from  the  ER,  cells 
coexpressing  Nub-Sec62p  and  Mfagy-Cub-Dha  were  la¬ 
beled  for  5  min  with  ^^S-methionine,  and  then  chased 
for  15  min  (Figure  4E).  Although  the  translocated 
MfQ:37-Cub-Dha  disappeared  rapidly  during  the  chase. 
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the  amount  of  free  Dha  that  accumulated  during  the 
pulse  remained  constant. 

We  conclude  that  the  in  vivo  proximity  between 
Sec62p  and  Mfa37-Cub-Dha  that  is  detected  by  the 
split-Ub  assay  occurs  either  during  or  very  shortly 
after  the  synthesis  of  MfQ:37-C^ib-Dha.  The  apparent 
disappearance  of  the  pulse-labeled,  translocated 
Mfa37-Cub-Dha  during  the  chase  accounts  for  the  dif¬ 
ficulty  in  detecting  this  species  by  a  steady-state  assay 
such  as  immunoblotting  (Figures  4 A,  5B,  and  6A).  The 
likely  cause  of  the  disappearance  of  translocated 
Mfa37-Cu^-Dha  is  its  molecular  mass  heterogeneity, 
owing  to  its  glycosylation,  which  results  in  a  smear 
upon  SDS-PAGE  (Figures  3 A,  4C,  and  4E). 

The  Efficiency  of  Ub  Reconstitution  Mediated  by 
1  N^^-Sec62p  Depends  on  Both  the  Identity  of  a  Signal 
Sequence  and  the  Position  of  in  the  Nascent 
^Polypeptide  Chain 

[The  proximity  of  Sec62p  to  The  signal  sequence  of 
^Mfa37-Cub-Dha  is  detected,  in  the  split-Ub  assay, 
through  the  ability  of  Nub-Sec62p  to  induce  the  cleav- 
^age  of  Mf a37-Cub-E)ha  at  the  C^b-Dha  junction  (Figure 
:.3A).  If  this  cleavage  reflects  the  physical  proximity 
ibetween  Sec62p  and  a  signal  sequence,  the  efficiency 
lot  cleavage  should  decrease  if  the  C^b  nioiety  is  moye'd 
jcloser  to  the  C  terminus  of  the  nascent  polypeptide 
"chain.  However,  this  purely  spatial  consideratiori  ne-' 
^  gleets  the  temporal  aspect  of  the  translocation  process 
(Walter  and  Johnson,  1994).  The  targeting  and  the 
actual  translocation  are  initiated  during  or  shortly  af¬ 
ter  the  synthesis  of  a  signal  sequence-bearing  protein. 
Consequently,  the  C-terminal  parts  of  the  nascent 
chain  may  still  be  synthesized,  or  at  least  associated 
^with  the  ribosome,  at  the  time  when  Sec62p  and^  the 
signal  sequence  have  already  become  spatially  close. 
Extending  the  spacer  would  increase  the  distance  be- 
.tween  C^b  tiie  signal  sequence  of  Mfa37-Cub-’E)ha. 
This  would  be  expected  to  decrease  the  time  window 
available  for  the  interaction  between  the  C^b  moiety  of 
Mfa37-Cub-Dha  and  the  moiety  of  Nub-Sec62p, 
^Therefore,  a  test  of  this  kind  cannot  deconvolute  the 
Icohtribution  of  each  of  the  two  parameters  (increased- 
i^patial  distance  along  the  chain  between  Sec62p  and^ 
iCub  arid  decreased  time  window  for  the  N^jb-C^b  iri- 
Jeraction)  to  the  overall  effect  of  extending  the  length 
pf  the  polypeptide  between  the  signal  sequence  and: 
Eub-  These  constraints  notwithstanding,  moving^  the 
yZ^'b  moiety  of  Mfa37-Cub‘Dha  further  away  frorn  its  ' 
signal  sequence  makes  it  possible  to  gauge  the  acces¬ 
sibility  of  Sec62p  to  specific  regions  of  the  nascent 
polypeptide  chain  in  vivo. 

‘I  In  the  actual  experiment,  the  distance  between  the 
signal  sequence  of  Mfa37-Cub-E)ha  and  its  C^b  moiety 
was  increased  from  37  to  65  residues  (Mfa55-Cub"Dha; 
Figure  2B,  construct  9).  The  results  of  EndoH  treat¬ 


ment  of  Mfags-C^b'Dha  immunoprecipitated  from 
pulse-labeled  wild-type  cells  confirmed  that  Mfa^s- 
C^b-Dha  was  efficiently  translocated  into  the  ER  (Fig¬ 
ure  5A).  However,  the  efficiency  of  the  Dha-yielding 
cleavage  of  Mia^^-C^^-Dha  upon  coexpression  of 
Sec62p  was  clearly  reduced  in  comparison  to  the  same 
cleavage  with  Mfa37-Cub‘'I^ha  and  Nub-'Sec62p  (Figure 
5B). 

Both  the  kinetics  and  the  mode  of  targeting  for 
translocation  are  influenced  by  the  identity  of  a  signal 
sequence  (Bird  et  al,  1987;  Johnsson  and  Varshavsky, 
1994b;  Ng  et  al.,  1996).  For  example,  the  efficient  trans¬ 
location  of  invertase  (Suc2p)  requires  the  SRP,  in  con¬ 
trast  to  a  much  weaker  requirement  for  SRP  in  the  case 
of  the  prepro-a-factor's  signal  sequence  (Harm  and 
Walter,  1991;  Ogg  et  al,  1992;  Johnsson  and  Var¬ 
shavsky,  1994b).  Consequently,  the  coupling  between 
translation  and  translocation  is  tighter  for  proteins 
bearing  the  invertase  signal  sequence  than  for  proteins 
carryirig  the  signal  sequence  of  the  a-factor. 

We  assessed  the  in  vivo  proximity  of  the  invertase 
signal  sequence  to  Sec62p  by  measuring  the  reconsti¬ 
tution  of  Ub  from  Nub“Sec62p  and  Suc2-Cub'Dha, 
where  the  Suc2p  moiety  was  linked  to  C^b  through  a 
spacer  of  increasing  length  (Figure  2B).  The  expression 
and  efficient  translocation  of  different  Suc2-Gub-Dha— 
constructs  were  assayed  by  immunoprecipitation  and 
subsequent  EndoH  treatment  (Figure  5A).  The  prox¬ 
imity  of  Cub  Suc2-Cub“Dha  to  N^b  of  Nub“Sec62p 
was  assayed  by  immunoblot  detection  of  the  cleavage- 
derived  free  Dha  in  whole-cell  extracts.  The  pattern 
already  observed  for  the  Mfa-Cub'Dha  constructs  re¬ 
curred  with  the  constructs  bearing  the  invertase  signal 
sequence  (Figure  5B).  Moreover,  coexpression  of  Nub" 
Sec62p  with  either  Suc223-Cub-E)ha  or  Suc233-Cub-Dha 
yielded  lower  amounts  of  free  Dha  than  the  analogous 
assays  with  Nub-Sec62p  and  Mfa37-Cub“Dha,  which 
bears  a  spacer  of  comparable  length  (Figure  5B). 

Pulse-chase  analyses  with  cells  expressing  Nub- 
Sec62p  (or  Nub”Boslp)  and  either  Suc223*'Cub’'Dha  or 
Suc259-Cub-Dha  confirmed  the  immunoblot  data.  Spe¬ 
cifically,  a  significant  release  of  free  Dha  was  observed 
only  for  the  pair  of  Nub“SeG62p  and  Suc223-Cub-Dha 
(Figure  5C).  Our  previous  work  has  shown  that  the 
segment  of  the  nascent  polypeptide  chain  where  the 
Cub  moiety  was  inserted  in  either  the  Suc223-Cub-Dha 
or  the  Mfa37-Cub’Dha  fusion  is  transiently  exposed  to 
the  cytosol — imtil  the  initiation  of  ER  translocation 
(Johnsson  and  Varshavsky,  1994b).  Therefore,  we 
compared  the  ratios  of  cleaved  to  uncleaved  Suc223- 
Cub-E)ha  and  MfQ:37-Cub“Oha.  Cells  expressing  Nub“ 
Sec62p  and  either  the  Cub  fusion  8  or  10  (Figure  2B) 
were  labeled  for  5  min  with  ^^S-methionine  and  pro¬ 
cessed  for  immunoprecipitation  with  anti-ha  antibody, 
followed  by  determination  of  the  cleaved-to-uncleaved 
ratio  (Figure  6D).  This  ratio,  a  measure  of  the  time- 
averaged  proximity  of  Sec62p  to  a  translocating  protein. 
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(black  bars:  b,  d,  f,  and  h)  fusions.  The  sum  of  a  cleaved  and  uncleaved  fusion 
also  indicated. 


was  ~  eightfold  higher  for  a  nascent  polypeptide  bearing 
the  signal  sequence  of  a-factor  than  for  a  nascent 
polypeptide  bearing  the  invertase  signal  sequence  (Fig¬ 
ure  6D;  compare”FigUres^40uri(f5C). 

Spacer  sequences  ofTliffereB#4ength  or  composition 
upstream  of  the  C„b  moietyunight  nonspecifically  in¬ 
fluence  the  interaction  between  Nut,  and  Cub-  To  assess 
this  potential  spacer  effed/we  constructed  signal  se¬ 
quence-lacking  versions  of— Suc2,,-C„;,-Dha  and 
SuC259-Cub-Dha  (Figure  2B,  Cub  fusions  15  and  16),  and 
compared  their  ability  to  reconstitute'  Ub  in  the  pres¬ 
ence  of  coexpressed  Nub-Sec62p.  Both  of  these  Cub 
fusions  were  cleaved  at  the  Cub-Dha  junction  at  approx- 
irnately  the  same  rate  in  the  presence  of  Nub-Sec62p 
(Figure  5D),  in  contrast  to  the  marked  difference  in  the 
rate  of  cleavage  observed  for  their  signal  sequence-bear¬ 
ing  coimterparts  (Figure  5,  B  and  C).  This  control  exper¬ 


Figure  6.  Sec61p,  but  not  a  mu¬ 
tant  of  Sec62p,  are  close  to  the  nas¬ 
cent  chain  of  a  translocated  pro¬ 
tein.  (A)  These  assays  employ^  S. 
cerevisiae  expressing  Mfa37<Z^^,- 
Dha  (construct  8,  Figure  2)  and 
one  of  the  following  fusions 
(Figure  2):  Nub-Sec62p,  either  inte¬ 
grated  (lane  a)  or  plasmid  borne 
(lane  b);  N^-Sec62(AC60)Dha 
(lane  c);  and  N^b-Secblp  (lane  d). 
Whole-cell  extracts  from  these 
strains  were  subjected  to  immuno- 
blot  analysis  with  anti-ha  anti¬ 
body.  (B)  Same  as  panel  A  but  the 
same  fusions  were  coex¬ 
pressed  with  Tpil-C^b-Dha  (con¬ 
struct  14;  Figure  2).  Numbers  2 
and  14  indicate  the  positions  of  the 
corresponding  (uncleaved)  fu¬ 
sions.  (C)  Lane  a:  S.  cerevisiae  ex¬ 
pressing  Suc223-Q,b-Dha  (con¬ 
struct  10;  Figiue  2)  together  jyviih 
either  N^,b-Sec61p7'N^a-Sec61p; 
Nug-Sec61p,  N^,b-Sec62p,  N^- 
Sec62p,  or  Nug-Sec62p;  lane  b: 
same  as  lane  a  but  cells  expressed 
MfQ:37-Cub“Dha  (construct  8;  Fig¬ 
ure  2)  instead  of  Suc2>3-Cyb-Dha. 
(D)  S.  cerevisiae  cells  expressing 
MfQ:37-Cyj,-Dha  together  with 
Nyb-Sec61p  (a  and  b)  or  N^b- 
Sec62p  (e  and  f),  and  cells  express¬ 
ing  Suc223-Cyb-nha  together  with 
Nub-Sec61p  (c  and  d)  or  N^b- 
Sec62p  (g  and  h)  were  labeled  for  5 
min  with  ^^S-methionine.  Whole¬ 
cell  extracts  were  immunoprecipi- 
tated  with  anti-ha  antibody,  fol¬ 
lowed  by  SDS-PAGE,  and 
quantitation  of  the  cleaved  and 
uncleaved  Qb  fusions  using  Phos- 
phorlmager.  Shown  are  the  rela¬ 
tive  amounts  of  the  cleaved  (white 
bars:  a,  c,  e,  and  g)  and  undeaved 
was  set  at  100  in  each  of  the  three  independent  experiments.  SDs  are 


iment  further  emphasized  the  effect  of  distance  between 
a  signal  sequence  and  the  C^b  moiety  on  the  efficiency  of 
Ub  reconstitution  in  the  presence  of  N„b-Sec62p.  We 
conclude  that  the  accessibility  of  Sec62p  in  vivo  to  a 
specific  region  of  the  nascent  polypeptide  chain  is  influ¬ 
enced  by  both  the  nature  of  a  signal  sequence  and  its 
distance  from  that  region. 

SecBlp  Is  EquidUtM  from  Two  Different  Signal 
Sequences 

A  direct  comparison  between  two  different  signal  se¬ 
quences  upstream  of  the  C^b  moiety  presumes  ap¬ 
proximately  equal  residence  times  of  the  correspond- 
irig  ^ub  moieties  in  the  cytosol.  It  is  also  essential  to 
know  that  the  influence  of  the  identity  of  a  signal 
sequence  on  the  rate  of  Ub  reconstitution  is  not  due  to 
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a  nonspecific  intramolecular  interaction.  One  way  to 
address  these  issues  involves  measuring  the  reconsti¬ 
tution  of  Ub  from  the  moieties  of  the  fusions  8  and 
10  (Figure  2B)  and  a  N^b-containing  fusion  that  is  not 
involved  in  translocation.  As  illustrated  in  Figure  4, 
this  test  is  not  feasible  because  of  the  rapid  transloca¬ 
tion  of  C^b'Containing  constructs  into  the  ER.  Note, 
however,  that  since  Sec61p  is  the  central  component  of 
the  translocation  pore,  proteins  that  utilize  different 
targeting  pathways  will  converge  at  Sec61p  shortly 
before  their  translocation  (Jungnickel  and  Rapoport, 

1995) .  Taking  advantage  of  this  property  of  Sec61p,  we 
assayed  the  proximity  of  Mfa37-Cub“F)ha  and  Suc223" 
Cub-Dha  to  Nub"Sec61p.  If  Mfa37-Cub-Dha  and  Suc223- 
C^b'^ha  are  cleaved  at  the  C^b-Dha  junction  equally 
well  in  the  presence  of  Nub-Sec61p,  the  above  inter¬ 
pretation  of  the  observed  selectivity  of  Nub"Sec62p 
toward  Mfa37-Cub''Dba  (Figures  5  and  6)  would  be 
confirmed. 

To  carry  out  this  test,  N^b  fused  to  the  cytosolic 
N  terminus  of  Sec61p  (Figure  2A) '(Wilkinson  ef’ a/., 

1996) .  Nub-Sec61p  is  functionally  active  (Wittk^  and 
Johnsson,  unpublished  data).  It  induced  the  release  of 
free  Dha  from  of  Mfa37-Cub-Dha  and  Tpilp-Cub-Dha 
with  efficiency  similar  to  that  of  N^b‘Sec62p  (Figure  6, 
A  and  B).  Thus,  the  split-Ub  assay  independently  con¬ 
firmed  that  Sec61p  is  close,  to  the  nascent  polypepfide 
chain  during  its  translocation.  To  compare  the  iri  a^vo 
interactions  of  Sec61p  with  the  C^b  fusions  8  aiid‘^10, 
which  bore  different  signal  sequences  (Figure  2B),  the 
amount  of  free  Dha  was  determined  by  immunoblot- 
ting  of  whole-cell  extracts.  It  was  found  that  in  the 
presence  of  Nub“Sec61p,  similar  amounts  of  Dha  were 
released  from  Mfa37-Cub-F)ha  and  Suc223-Cub-Dha, 
whereas  in  the  presence  of  Nub-Sec62p  twice  as  much 
Dha  was  released  from  MfQ:37-Cub"^^^^  than  from 
Suc223-C^,b-Dha  (Figure  6C). 

This  result  was  confirmed  and  extended  by  label¬ 
ing  the  cotransformed  cells  for  5  min  with  ^^S-me- 
thionine  and  quantifying  the  ratio  of  cleaved-to- 
uncleaved  C^b  fusions  (Figure  6D).  As  was  already 
observed  by  the  immunoblot  analysis,  the  above 
ratio  was  ~1  for  both  Mfa37-Cub-Dha  and  Sug223-' 
C^b-^^ha  in  the  presence  ofN^b"Sec61p,  but 
Mfa:37-Cub-Dha,  and  —0.25  for  Suc223-Cub-Dha  in  the 
presence  of  N^b“Sec62p  (Figure  6D).  The  diffeirerice 
revealed  by  the  pulse-immunoprecipitation  analysis 
is  higher  than  the  estimate  obtained  by  the  immu¬ 
noblot  analysis,  most  likely  because  of  the  continu¬ 
ous  accumulation  of  cleaved  (and  long-lived). 'Dha 
before  the  processing  of  cells  for  immunoblotting. 

A  C-terminally  Truncated  Sec62p  Is  No  Longer 
Proximal  to  the  Signal  Sequence 

Does  the  proximity  of  Sec62p  to  a  nascent  polypeptide 
chain  that  is  detected  by  the  split-Ub  assay  reflect  the 

Vol.  10,  February  1999 


physical  binding  of  the  signal  sequence  to  this  pro¬ 
tein?  We  constructed  a  derivative  of  Nub"Sec62p  in 
which  the  C-terminal  60  residues  of  Sec62p  were  re¬ 
placed  by  the  DHFR-ha  (Dha)  moiety,  yielding 
Sec62(AC60)-Dha.  A  similar  Sec62p-invertase  fusion 
was  described’  by  Deshaies  and  Schekman  (1990)  and 
shown  to  be  nonfunctional.  As  expected,  neither 
Sec62(AC60)-Dha  nor  Nub-Sec62(AC60)-Dha  comple¬ 
mented  the  ts  phenotype  of  RSY529  cells  (Figure  3B). 

The  Ub-reconstitution  activity  of  Nub"Sec62(AC60)- 
Dha  in  the  presence  of  either  Mfa:37-Cub-Dha  or  Tpil- 
C^b-Dha  (Figure  6,  A  and  B)  was  compared  with  the 
activity  of  Nub-Sec62p  and  Nub-Sec61p  in  the  presence 
of  the  same  C^b-containing  fusions.  Remarkably,  no 
cleavage  of  Mfa37-Cub"Dha  was  observed  in  the  pres¬ 
ence  of  Nub“Sec62(AC60)-Dha,  whereas  the  cytosolic 
Tpil-Cub“Dha  was  cleaved.  This  result  (Figure  6,  A 
and  B)  indicated  that  the  concentration  and  accessibil¬ 
ity  of  N^b  were  comparable  for  the  functionally  inac¬ 
tive  ]S^ub"Sec62(AC60)-Dha  and  the  functionally  active 
Nub-S^c62p.  In  these  experiments,  Ni^b‘"Sec62(AC60)- 
Dha,  which  could  be  detected  with  the  anti-ha  anti¬ 
body  (Figure  6,  A  and  B),  was  expressed  from  the 
uninduced  Pcupi  promoter.  Strikingly,  even  overex¬ 
pression  of  Nub-Sec62(AC60)-Dha,  from  the  copper- 
induced  P(2upi/  did  not  result  in  a  significant  cleavage 
^  of  Mfa37-Cub''Uha  (our  unpublished  results).  These 
control  experiments  with  Ae  inactive  derivative  of 
Sec62p  indicated  that  the  proximity  signal  in  the 
split-Ub  assay  with  Sec62p  requires  the  functional  ac¬ 
tivity  of  Sec62p. 

Using  UraSp  Reporter  to  Detect  the  In  Vivo 
Proximity  between  Sec62p  and  Signal  Sequences 

The  DHFR-ha  (Dha)  reporter  moiety  of  Mfa37-C^b" 
Dha  was  replaced  by  S.  cerevisiae  UraSp  (orotidine-5'- 
phosphate  decarboxylase),  yielding  Mfa37-Cub-Ura3p. 
The  use  of  cytosolic  UraSp  as  a  reporter  for  transloca¬ 
tion  across  membranes  is  well  documented  (Maarse  et 
al,  1992;  Johnsson  and  Varshavsky,  1994b;  Ng  et  al, 
1996).  The  high  sensitivity  of  UraSp-based  assays  (cells 
become  Ura*^  if  a  threshold  amount  of  UraSp  is 
present  in  the  cytosol)  allowed  us  to  express  the  Nub 
and  C^b  fusions  from  the  uninduced  Pcupi  promoter. 
Since  the  efficient  translocation  of  MfQ:37-C^b"Ura3p 
sequesters  the  UraSp  activity  in  the  ER,  a  wraSA  strain 
of  S.  cerevisiae  that  expressed  Mfa37-Cub"Ura3p  re¬ 
mained  Ura“  (Figure  7A).  Nub“Sec62p,  which,  as 
shown  above,  is  close  to  the  nascent  chain  of  Mfa:37- 
Cub-Dha  during  its  translocation,  induced  enough 
cleavage  of  MfQ:37-Cub-Ura3p  at  the  C^b-UraSp  junc¬ 
tion  to  render  cells  Ura"^  (Figure  7B).  Cells  were  trans¬ 
formed  with  either  Nub-Sec62p,  N^b-Secblp,  N^b’ 
Sec62(AC60)-Dha,  or  N^b-Boslp  to  compare  relative 
proximities  of  these  N^b-containing  proteins  to  C^b 
fusions  bearing  the  UraSp  reporter  moiety  and  either 
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Figure  7.  The  use  of  a  metabolic  marker  to  assess  the  proximity 
between  a  component  of  the  translocation  machinery  and  a  trans¬ 
located  protein.  S.  cereoisiae  expressing  the  fusions  8-12  (Figure 
2)  that  contained  Ura3p  instead  of  Dha  (see  the  main  text)  were 
transformed  with  the  vector  (A)  or  plasmids  expressing  N  h-Sec62p 
(B),  N„b-Sec61p  (C),  N„i,-Sec62(AC60)Dha  (D),  and  N„b-Boslp  (E) 
Cells  were  grown  in  a  Uquid  uracil-containing  SD  medium,  and 
~10  ,  10^  and  10^  cells  were  spotted  onto  uracil-lacking  SD  me¬ 
dium.  Plates  were  examined  after  18  h  at  30°C. 


the  invertase-derived  or  the  a-factor-derived  signal 
sequence  (Figure  7,  B-E).  The  cells  were  spotted  on 
plates  lacking  uracil  and  incubated  at  30°C  for  18  h. 
T|ie  growth  patterns  ohstr-ains^that  expressed  different 
c|mbinations  of  and  Cu{,-containing  fusions  con- 
filmed  the  results  -of-ana'ly.ses'  with  analogous  (but 
rtmre  highly  expressed)  Dha-based  constructs. 

fn  particular,  the  interaction  of  Sec62p  with  the  sig¬ 
nal  sequence  of  prepro-cFfactor  was  stronger  than 
w|th  the  signal  sequence  of  mvertase.  This  proximity 
wSs  not  detectable  whenThl^distahce  between  a  signal 
sequence  and  the  moiety  of  a  fusion  was  increased 
(Figure  7^).  Sec61p  appears  to  be  equally  close  to  both 
of  the  signal  sequences  tested.  Again,  the  proximity 
signal  was  gradually  lost  when  the  distance  between 
the  signal  sequence  and  the  moiety  was  increased 
(Figure  7C).  Cells  acquired  a  weak  Ura+  phenotype  in 
the  presence  of  N^b-Boslp  and  the  C„b-Ura3p  fusions 


8  and  10  (Figure  7E).  If  used  as  a  reference  to  discrim¬ 
inate  between  specific  and  nonspecific  signals  in  this 
assay,  Sec62p,  under  these  conditions,  appears  to  in¬ 
teract  only  with  the  a-factor  signal  sequence.  No  in¬ 
teraction  with  any  of  the  tested  C^b  constructs  was 
detectable  with  the  functionally  inactive  N  h- 
Sec62(AC60)-Dha  (Figure  7D). 

DISCUSSION 

The  new  application  of  the  split-Ub  technique  (Johns- 
son  and  Varshavsky,  1994a,  1997)  described  in  the 
present  work  introduces  a  tool  for  the  analysis  of 
transient  (short-lived)  protein  interactions  in  living 
cells.  A  split-Ub  assay  involves  the  tagging  of  two 
(presumably)  interacting  proteins  with  the  N-  and 
C-terminal  halves  of  Ub,  N^b  and  C^b/  and  monitoring, 
in  a  variety  of  ways,  the  release  of  the  reporter  protein 
fused  to  the  C  terminus  of  C^b-  The  reporter  release, 
through:  the  cleavage  by  Ub-specific  processing  pro¬ 
teases  (tJBPs),  takes  place  in  the  cytosol  if  the  two 
halves  of  Ub  interact  in  vivo  to  form  a  quasi-native  Ub 
moiety  upstream  of  the  reporter  (Figure  1).  Among  the 
advantages  of  this  method  are  its  applicability  either 
in  living  cells  or  in  vitro  and  its  sensitivity  to  kinetic 
aspects  of  a  protein  interaction. 

In  the  present  work,  we  applied  the  split-Ub  tech¬ 
nique  to  the  problem  of  protein  translocation  across 
membranes.  We  showed  that  Sec62p  of  S.  cerevisiae  is 
spatially  close  to  the  signal  sequence  of  the  nascent 
a-factor  polypeptide  in  vivo.  This  proximity  is  con¬ 
fined  to  the  nascent  polypeptide  chain  immediately 
following  the  signal  sequence.  In  addition,  the  extent 
of  proximity  depends  on  the  nature  of  the  signal  se¬ 
quence.  Specifically,  Cyb-containing  test  proteins  that 
bore  the  signal  sequence  of  invertase  resulted  in  a 
much  lower  Ub  reconstitution  with  Nub-Sec62p  than 
the  same  C^b-containmg  proteins  bearing  the  signal 
sequence  of  a-factor.  An  inactive  derivative  of  Sec62p 
failed  to  interact  with  signal  sequences  in  the  split-Ub 
assay.  Taken  together,  these  findings  are  the  first  in 
vivo  evidence  that  S.  cerevisiae  Sec62p,  a  component  of 
the  ER  translocation  machinery,  is  a  part  of  a  signal 
sequence-binding  complex. 

In  Vivo  Proximity  between  Sec62p  and  the  Signal 
Sequence  of  a-Factor 

We  have  previously  shown  that  a  region  of  the  nascent 
polypeptide  chain  that  lies  close  to  the  signal  sequence 
of  invertase  or  the  prepro-a-factor  is  briefly  exposed  to 
the  cytosol  before  its  translocation  into  the  ER  (Johns- 
son  and  Varshavsky,  1994b).  This  feature  of  transloca¬ 
tion  enabled  us,  in  the  present  work,  to  use  the 
split-Ub  assay  for  monitoring  the  proximity  between  a 
secretory  protein  and  components  of  the  translocation 
machinery.  The  C^b  moiety  was  placed  37  residues 
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downstream  from  the  signal  sequence  of  the  a-factor 
precursor,  and  the  N  terminus  of  Sec62p  (see  INTRO¬ 
DUCTION)  was  extended  with  N^^.  Using  this  ver¬ 
sion  of  the  spIit-Ub  assay,  we  could  demonstrate  that 
Sec62p  is  close  to  the  nascent  chain  of  the  a-factor 
during  its  translocation.  By  moving  the  C^b  moiety 
farther  downstream  from  the  signal  sequence  of  the 
a-factor  precursor,  we  obtained  "snapshots"  of  the 
relative  proximity  between  the  nascent  polypeptide 
chain  and  Sec62p  in  vivo.  The  proximity  thus  detected 
was  considerably  reduced  once  the  spacer  sequence 
between  the  signal  sequence  of  the  a-factor  precursor 
and  C^b  was  increased  from  37  to  65  residues  (Figures 
5B  and  7B).  The  data  strongly  suggest  that  the  access 
of  Sec62p  to  the  nascent  chain  of  a-factor  is  confined  to 
a  region  of  the  nascent  chain  that  is  very  close  to  the 
signal  sequence.  Similar  results  were  obtained  with 
the  signal  sequence  of  invertase  as  well.  This  property 
of  Sec62p  is  the  expected  feature  of  a  component  of  a 
signal  sequence  receptor. 

Our  interpretation,  supported  by  several  control  ex-^ 
periihents,  is  in  agreement  with  the  results  produced 
by*  cross-linking  and  binding  studies  in  cell-free  "Sys-  -'' 
terns  (Miisch  et  al.,  1992;  Lyman  and  Schekman,  1997; 
Matlack  et  ah,  1997).  Sec62p  could  be  cross-linked  to 
the  a-factor  precursor  in  vitro,  but  only  when  ATP 
w^  jomitted  and  the  initiation  of  translocation  ‘  t)f  ^ 
a-^ctor  was  halted.  Upon  the  addition  of  ATP,  the  i  ’ 
translocation  resumed  and  cross-linking  waB"'no-: 
longer  possible  (Miisch  et  ah,  1992;  Lyman  and  Schek-  • 
man,  1997).  The  cross-linking  between  Sec62p  and  the 
nascent  polypeptide  chain  was  not  observed  when  the 
translocating  chain  was  halted  in  the  ER  channel 
(Miisch  et  ah,  1992;  Sanders  et  ah,  1992).  It  was  there¬ 
fore  assumed  that  Sec62p  is  not  a  part  of  the  channel 
and  that  it  functions  in  the  early  steps  of  substrate 
recognition  and  initiation  of  translocation.  The 
split-Ub  assay,  in  its  current  form,  depends  on  both 
halves  of  Ub  being  in  the  cell's  cytosol.  Therefore,  the 
absence  of  the  diagnostic  cleavage  (Figures  5B,  5C, 
and  7)  when  the  C^b  nioiety  was  placed  farther  down¬ 
stream  from  the  signal  sequence  (see  RESULTS),  while 
consistent  with  the  absence  of  interactions  between 
Sed62p  and  the  nascent  chain  after  the  initiation  of  its  n. 
tr^slocation,  does  not  address  this  issue  directly.  The  " 
Cul  moiety  that  emerges  from  the  ribosome  after  it  has 
dofked  at  the  ER  channel  is  not  accessible  to  the 
cytosolic  N^b  moiety  even  if  an  Nub"linked  protein  is  ' " 
spatially  close  to  the  translocation  pore.  This  also  ex- 
pl^ns  the  inability  of  N^b'Secblp  to  induce  the  deav-“’' 
agd  of  Cub“Containing  translocation  substrates  bearing 
long  spacer  sequences  between  the  signal  sequence  and 
Cub/  dthough  the  in  vitro  cross-linking  studies  have 
shown  Sec61p  to  be  in  constant  contact  with  the  trans-. 
locating  polypeptide  (Mothes  et  ah,  1994)  (Figure  7). 

The  proximity  between  Sec62p  (or  Sec61p)  and  a 
translocating  polypeptide  is  short  lived.  The  rapid 


transfer  of  the  C^b  moiety  into  the  lumen  of  the  ER 
was  shown  to  either  prevent  or  strongly  inhibit  its 
interaction  with  the  N^b  moiety  of  Nub”Sec62p  and 
Nub”Sec61p  (Figures  3A  and  6C).  In  these  experiments, 
the  Nub  moieties  bore  either  the  glycine  (Nug)  or  the 
alanine  (Nua)  residue  at  position  13  of  Nub-  These 
modifications  decrease  the  affinity  between  the  two 
halves  of  Ub  (see  INTRODUCTION),  thereby  making 
the  reconstitution  of  a  quasi-native  Ub  moiety  more 
dependent  on  the  stability  (half-life)  of  interactions 
between  the  proteins  linked  to  Nub  and  Cub-  These 
assays  clearly  distinguished  the  Sec62p-Sec61p  signal 
sequence  interactions  from  those  that  underlie  the  bet¬ 
ter  understood,  longer-lived  protein  complexes.  For 
example,  when  linked  to  homodimerizing  leucine  zip¬ 
pers,  the  Nua  moiety,  and  even  the  Nu^  moiety,  is 
sufficient  for  reconstitution  of  the  Ub  (Jonnsson  and 
Varshavsky,  1994a), 

•  Is  Sec62p  Part  of  a  Signal  Sequence  Receptor? 

The  split-Ub  assay  measures  the  concentrations  of  the 
^  protein-coupled  Nub  Cub  moieties  in  the  immedi¬ 
ate  vicinity  of  each  other.  Therefore,  a  positive  result 
of  a  split-Ub  assay  signifies  a  spatial  proximity  be- 
;  tween  the  two  proteins  but  cannot,  by  itself,  prove 
.  their  physical  interaction  or  address  the  functional 
;  significance  of  this  proximity.  Nub"Sec62(AC60)Dha  is 
functionally  inactive  and  was  shown  to  be  not  close  to 
the  translocating  Mfa37-Cub-Dha  (Figures  6  and  7). 
Since  the  sequence  between  Nub  the  first  mem¬ 
brane-spanning  region  of  Sec62  was  retained  in  the 
C-terminally  truncated  Sec62(AC60)Dha,  the  distance 
between  Nub  and  the  ER  membrane  was,  most  prob¬ 
ably,  not  altered  relative  to  wild-type  Sec62p,  The  lack 
-  of  significant  cleavage  of  Mfa37-Cub"Dha  in  the  pres¬ 
ence  of  Nub-Sec62(AC60)Dha  must  therefore  result 
from  the  increased  distance  between  Sec62(AC60)p 
and  the  translocating  polypeptide  chain.  The  C-termi- 
nal  domain  of  intact  Sec62p  may  contact  other  com¬ 
ponents  of  the  translocation  complex;  alternatively,  it 
may  contribute  to  a  binding  site  for  the  signal  se¬ 
quence  or  the  nascent  chain.  These  and  related  uncer¬ 
tainties  notwithstanding,  our  results  (Figures  6  and  7) 
L  provide  the  first  in  vivo  evidence  to  support  the  view 
that  Sec62p  is  part  of  a  signal  sequence-binding  com¬ 
plex, 

Sec62p  Discriminates  between  Different  Signal 
Sequences 

The  split-Ub  assay  has  made  it  possible  to  show  that 
Sec62p  discriminates,  in  living  cells,  between  two  dis¬ 
tinct  signal  sequences  in  otherwise  identical  fusion 
proteins.  In  the  presence  of  Nub-Sec62p,  more  of  the 
free  Dha  reporter  protein  was  produced  in  vivo  from 
Mfa37-Cub-Dha  than  from  Suc223-Cub”Dha  (Figures  5 
and  6).  This  selectivity  is  a  property  of  Sec62p  and  not 
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a  feature  of  the  assay  used,  since  approximately  equal 
amounts  of  the  cleaved  reporter  were  produced  with 
different  signal  sequences  if  the  N^b  moiety  was 
present  as  the  Nub-Sec61p  fusion  (Figures  6  and  7). 
This  result  confirmed,  in  vivo,  that  different  targeting 
pathways  converge  at  the  Sec61-containing  complex 
to  initiate  translocation.  The  existence  of  at  least  two 
different  targeting  pathways  to  the  translocation  pore 
was  suggested  by  Walter  and  co-workers  on  the  basis 
of  the  properties  of  yeast  mutants  that  lacked  either 
SRP  or  its  receptor  (Harm  and  Walter,  1991;  Ogg  et  al, 
1992).  One  possibility  is  that  the  targeting  via  SRP 
operates  cotranslationally,  whereas  the  targeting  via 
the  Sec62/63  complex  is  predominantly  posttransla- 
tional.  The  bifurcation  between  the  cotranslational  and 
posttranslational  targeting  is  expected  to  be  stochastic 
for  many  translocated  proteins.  Nonetheless,  certain 
signal  sequences  do  prefer  SRP,  while  some  of  the 
other  signal  sequences  are  targeted  by  the  Sec62/ 
Sec63  complex -(Ng-ef-a/.f4 996).  Genetic  studies  have 
shown  that  the-^anslocation  of  invertase  continues  in 
the  presence  of  mutations  in  either  the  SRP  or  the 
Sec62/63  comple^D^haies  and  Schekman,  1989; 
Ogg  et  al,  1992).  However;  the  kinetics  and  efficiency 
of  invertase  translocation  are  altered  in  the  absence  of 
SRP  (Harm,  and  Walter,  1991;  Johnsson  and  Var¬ 
shavsky,  1994b).  — 

To  explain  the  different  efficiencies  of  cleavage  of  the 
two  signal  sequence-bearing  C^_,b  fusions,  we  propose 
the  following  model.  The  signal  sequence  of  invertase 
is  recognized  primarily  by  SRP  and  then  transferred  to 
the  trimeric  Sec61p  complex,  which  completes  the  pro¬ 
tein's  translocation  across  Ae  ER  membrane.  Under 
conditions  that  result  in  a  shortage  of  SRP  or  a  com¬ 
petition  among  different  signal  sequences,  the  Sec62p/ 
Sec63p-containing  complex,  being  a  part  of  the  alter¬ 
native  targeting  pathway,  would  recognize  an 
increasing  fraction  of  invertase.  This  would  explain 
why  specific  interactions  between  the  invertase  signal 
sequence-bearing  proteins  and  Sec62p  can  only  be 
observed  for  the  more  highly  expressed  Cub“Dha  fu¬ 
sions  (Figure  5C).  By  contrast,  the  targeting  of  proteins 
bearing  the  signal  sequence,.of  the  a-factor  precursor  is 
mediated,- -in  -predominantly  by  the  Sec62p/ 
Sec63p-containmg-complex^,Thislwould  account  for 
the  observed  close‘proximity”of  these  test  proteins  to 
Sec62p  (Figures  5-7)7TPsd^h^split-Ub  technique 
makes  it  possible  to  estimate^he  flux  of  two  different 
secretory  proteins  through_tlxeJ:argeting  pathways  to 
the  ER  membmne  withouMhe-necessity  of  deleting'or 
otherwise  inactivating  specific  components  of  the  tar¬ 
geting  complex. 

Further  Applications  of  the  Split-Ub  Technique 

The  split-Ub  sensor  should  also  be  applicable  to  other 
settings  that  involve  short-lived  protein  interactions 


that  occur  in  the  cytosol  and  are  freely  accessible  to  the 
Ub-specific  proteases.  The’  advantage  of  using  this 
method  for  the  analysis  of  protein  translocation  stems, 
in  part,  from  the  fast  and  irreversible  removal  of  the 
translocated  chain  from  the  location  (cytosol)  where 
the  Hib/Cub  interaction  is  monitored.  Similar  situa¬ 
tions  are  expected  for  the  translocation  of  proteins  into 
other  organelles  such  as  the  mitochondrion,  the  nu¬ 
cleus,  and  the  peroxisome. 

The  demonstration,  in  the  present  work,  that  Ura3p 
can  serve  as  a  reporter  in  a  split-Ub  assay  (Figure  7) 
opens  the  way  to  genetic  screens  based  on  this  assay. 
For  example,  introducing  a  DNA  library  consisting  of 
random  N^^^-gene  fusions  into  a  strain  expressing  a 
signal  sequence-bearing  C^b'Ura3p  protein  should  al¬ 
low  the  identification  of  genes  involved  in  targeting  or 
translocation  by  enabling  the  cells  to  form  colonies  on 
media  lacking  uracil.  This  selection  for  protein  ligands 
that  interact  transiently  in  the  vicinity  of  a  membrane 
complements  the  recent  split-Ub-based  screen  for  li¬ 
gands  that  form  relatively  stable  complexes  (Staeljar  et 
al,  1998). 
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The  N-degrons,  a  set  of  degradation  signals  recognized 
by  the  N-end  rule  pathway,  comprise  a  protein’s  desta¬ 
bilizing  N-terminal  residue  and  an  internal  lysine 
residue.  We  show  that  the  strength  of  an  N-degron  can 
be  markedly  increased,  without  loss  of  specificity, 
through  the  addition  of  lysine  residues.  A  nearly 
exhaustive  screen  was  carried  out  for  N-degrons  in  the 
lysine  (K)-asparagine  (N)  sequence  space  of  the 
14-residue  peptides  containing  either  K  or  N  (16  384 
different  sequences).  Of  these  sequences,  68  were  found 
to  function  as  N-degrons,  and  three  of  them  were  at 
least  as  active  and  specific  as  any  of  the  previously 
known  N-degrons.  All  68  K/N-based  N-degrons  lacked 
the  lysine  at  position  2,  and  all  three  of  the  strongest 
N-degrons  contained  lysines  at  positions  3  and  15.  The 
results  support  a  model  of  the  targeting  mechanism  in 
which  the  binding  of  the  E3-E2  complex  to  the  sub¬ 
strate’s  destabilizing  N-terminal  residue  is  followed  by 
a  stochastic  search  for  a  sterically  suitable  lysine 
residue.  Our  strategy  of  screening  a  small  library  that 
encompasses  the  entire  sequence  space  of  two  amino 
acids  should  be  of  use  in  many  settings,  including 
studies  of  protein  targeting  and  folding. 

Keywords:  N-degron/N-end  rule/proteolysis/simple 
sequences/ubiquitin 


Introduction 

Regulatory  proteins  are  often  short-lived  in  vivo,  providing 
a  way  to  generate  their  spatial  gradients  and  to  rapidly 
adjust  their  concentration  or  subunit  composition  through 
changes  in  the  rate  of  their  synthesis  or  degradation. 
Most  of  the  damaged  or  otherwise  abnormal  proteins  are 
metabolically  unstable  as  well.  Many  other  proteins,  while 
long-lived  as  components  of  larger  structures  such  as 
ribosomes  and  oligomeric  proteins,  are  short-lived  as  free 
subunits  (reviewed  by  Hochstrasser,  1996;  Varshavsky, 
1997;  Hershko  and  Ciechanover,  1998;  Scheffner  et  al, 
1998;  Koepp  et  al,  1999;  Tyers  and  Willems,  1999). 

Features  of  proteins  that  confer  metabolic  instability 
are  called  degradation  signals,  or  degrons  (Laney  and 
Hochstrasser,  1999).  One  class  of  degradation  signals, 
called  the  N-degrons,  comprises  a  protein’s  destabilizing 


N-terminal  residue  and  an  internal  Lys  residue  (Bachmair 
et  al,  1986;  Varshavsky,  1996).  A  set  of  N-degrons 
containing  different  N-terminal  residues  that  are  destabiliz¬ 
ing  in  a  given  cell  defines  a  rule,  termed  the  N-end  rule, 
which  relates  the  in  vivo  half-life  of  a  protein  to  the 
identity  of  its  N-terminal  residue.  The  lysine  determinant 
of  an  N-degron  is  the  site  of  formation  of  a  substrate- 
linked  multi-ubiquitin  chain  (Bachmair  and  Varshavsky, 
1989;  Chau  et  al,  1989).  The  N-end  rule  pathway  is  thus 
one  pathway  of  the  ubiquitin  (Ub)  system.  Ub  is  a 
76-residue  protein  whose  covalent  conjugation  to  other 
proteins  plays  a  role  in  a  multitude  of  processes,  including 
cell  growth,  division,  differentiation,  and  responses  to 
stress  (Pickart,  1997;  Varshavsky,  1997;  Peters,  1998; 
Scheffner  et  al,  1998).  In  many  of  these  settings,  Ub  acts 
through  routes  that  involve  the  degradation  of  Ub-protein 
conjugates  by  the  26S  proteasome,  an  ATP-dependent 
multisubunit  protease  (Coux  et  al,  1996;  Hilt  and  Wolf, 
1996;  Baumeister  et  al,  1998;  Rechsteiner,  1998). 

The  N-end  rule  is  organized  hierarchically.  In  the 
yeast  Saccharomyces  cerevisiae,  Asn  and  Gin  are  tertiary 
destabilizing  N-terminal  residues  in  that  they  function 
through  their  conversion,  by  the  ATZAi-encoded  N-terminal 
amidase,  into  the  secondary  destabilizing  N-terminal  res¬ 
idues  Asp  and  Glu  (Baker  and  Varshavsky,  1995),  The 
destabilizing  activity  of  N-terminal  Asp  and  Glu  requires 
their  conjugation,  by  the  ATEi -encoded  Arg-tRNA-protein 
transferase,  to  Arg,  one  of  the  primary  destabilizing 
residues.  The  primary  N-terminal  residues  are  bound 
directly  by  the  t/RRi-encoded  N-recognin,  the  E3  (recog¬ 
nition)  component  of  the  N-end  rule  pathway.  In  S.  cerevis¬ 
iae,  N-recognin  is  a  225  kDa  protein  that  binds  to  potential 
N-end  rule  substrates  through  their  primary  destabilizing 
N-terminal  residues:  Phe,  Leu,  Tip,  Tyr,  He,  Arg,  Lys  and 
His  (Varshavsky,  1996).  Analogous  components  of  the 
mammalian  N-end  rule  pathway  have  been  identified  as 
well  (Stewart  et  al,  1995;  Grigoryev  et  al,  1996;  Kwon 
etal,  1998,  1999). 

Studies  with  engineered  N-end  rule  substrates  indicated 
the  bipartite  organization  of  N-degrons  and  suggested  a 
stochastic  model  of  their  targeting,  in  which  specific 
lysines  of  an  N-end  rule  substrate  could  be  assigned 
different  probabilities  of  being  used  as  a  ubiquitylation 
site  (Bachmair  and  Varshavsky,  1989;  Chau  et  al,  1989; 
Johnson  et  al,  1990;  Hill  et  al,  1993;  Varshavsky,  1996; 
Levy  et  al ,  1999).  Most  of  the  evidence  for  this  model  was 
produced  with  a  set  of  N-degrons  in  which  a  destabilizing 
N-terminal  residue  X  was  linked  to  the  ~40-residue 
Escherichia  coli  Lac  repressor-derived  sequence  termed 
e^  [extension  (e)  bearing  lysines  (K)]  (Figure  lA)  (Bachm¬ 
air  and  Varshavsky,  1989).  The  resulting  X-e^  sequence 
comprised  a  portable  N-degron,  which  could  confer  short 
half-lives  on  test  proteins  such  as  E.coli  j3-galactosidase 
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Fig.  1,  Test  proteins.  (A)  Fusions  used  in  this  work  contained  some  of 
the  following  elements  (see  Materials  and  methods):  DHFRha,  a 
mouse  dihydrofolate  reductase  moiety  extended  at  the  C-termi’nus  by  a 
sequence  containing  the  hemagglutinin-derived  ha  epitope;  the 
moiety  bearing  the  Lys--»Arg  alteration  at  position  48;  a  40-residue 
E.coli  Lac  repressor-derived  sequence,  termed  e^  and  shown  in  single- 
letter  abbreviations  for  amino  acids;  a  variable  residue  X  (either  Tyr 
His  or  Met)  between  Ub^^g  ^k. 

first  24  residues  of  wild-type  PgaL  The  positions  of  single  or  multiple 
KRK  msertions  into  e^  are  indicated.  The  endogenous  KRK  sequence 
of  e  is  underlined.  The  arrow  indicates  the  site  of  in  vivo  cleavage  by 
DUBs.  (B)  The  fusion  construct  used  for  screening  in  the  K/N 
sequence  space.  The  14  residues  of  e^  immediately  following  the 
residue  X  were  replaced  by  a  set  of  14-residue  sequences  that 
comprised  a  random  permutation  of  Lys  and  Asn  residues,  followed  by 
the  sequence  HGSGAWLLPVSLVRS,  derived  from  residues  2-14  of 
the  e^  extension,  followed  by  Arg-Ser. 


(Pgal)  or  mouse  dihydrofolate  reductase  (DHFR)  (Varshav¬ 
sky,  1996).  At  least  one  of  two  lysines  (K)  in  e*^,  either 
K-15  or  K-17,  must  be  present  for  the  N-degron  to  be 
active  (Figure  lA)  (Bachmair  and  Varshavsky,  1989; 
Johnson  et  al,  1990).  Even  though  several  other  classes 
of  N-degron,  including  the  naturally  occurring  ones,  have 
been  described  over  the  last  decade  (Townsend  et  al, 
1988;  deGroot  et  al,  1991;  Dohmen  et  al,  1994;  Sadis 
and  Finley,  1995;  Ghislain  et  al,  1996;  Sijts  et  al,  1997; 
Tobery  and  Siliciano,  1999),  the  mechanistic  understanding 
of  these  degradation  signals  remains  confined  largely  to  the 
e’^-based  N-degrons  (Varshavsky,  1996;  Levy  era/.,  1999). 

In  the  present  work,  we  show  that  spiking  an  e^-based 
N-degron  with  additional  Lys  residues  can  markedly 
increase  its  activity.  We  also  show,  using  a  new  approach 
of  searching  in  the  sequence  space  of  lysine  and  asparagine, 
that  simple-sequence  N-degrons  can  be  as  strong  and 
specific  as  any  of  the  previously  known  N-degrons.  These 
findings  provide  independent  evidence  for  the  model  of- a 
bipartite  N-degron  and  stochastic  targeting  mechanism 
(Bachmair  and  Varshavsky,  1989).  The  strategy  of  exhaust¬ 
ive  searching  in  the  sequence  space  of  two  amino  acids 
should  be  of  use  in  many  settings,  including  studies  of 
protein  folding  and  degradation. 

Results  and  discussion 

The  Ub/protein/reference  technique 

The  assays  below  utilized  the  previously  developed  Ub/ 
protein/reference  (UPR)  technique,  which  increases  the 


accuracy  of  pulse-chase  analysis  by  providing  a  ‘built-in’ 
reference  protein  (Levy  etal,  1996).  This  method  employs 
-a  linear  fusion  in  which  Ub  is  located  between  a  protein 
of  interest  and  a  reference  protein  moiety  (Figure  lA). 
The  fusion  is  co-translationally  cleaved  by  Ub-specific 
de-ubiquitylating  enzymes  (DUBs)  (Wilkinson  and 
Hochstrasser,  1998)  after  the  last  residue  of  Ub,  producing 
equimolar  amounts  of  the  protein  of  interest  and  the 
reference  protein  bearing  the  C-terminal  Ub  moiety.  If 
both  the  reference  protein  and  the  protein  of  interest  are 
immunoprecipitated  in  a  pulse-chase  assay,  the  relative 
amounts  of  the  protein  of  interest  can  be  normalized 
against  the  reference  protein  in  the  same  sample.  The 
UPR  technique  can  thus  compensate  for  the  scatter  of 
immunoprecipitation  yields,  sample  volumes  and  other 
sources  of  sample-to-sample  variation  (Levy  et  al 
1996,  1999). 

Two  previously  introduced  terms,  ID^,  initial  decay,  i.e. 
the  decay  of  a  protein  during  the  pulse  of  x  min,  and 
ih®  protein  s  half-life  averaged  over  the  interval  of  y 
to  z  min  of  chase  (Levy  et  al,  1996),  are  used  below  to 
describe  the  decay  curves  of  test  proteins.  The  ID"'  term 
and  the  interval-specific  term  tl~l  would  be  superfluous 
in  the  case  of  a  strictly  first-order  decay,  which  is  defined 
by  a  single  half-life.  However,  the  in  vivo  degradation 
of  most  proteins  deviates  from  first-order  kinetics.  For 
example,  the  rate  of  degradation  of  short-lived  proteins 
can  be  much  higher  during  the  pulse,  in  part  because  a 
newly  labeled  (either  nascent  or  just-completed)  polypep¬ 
tide  is  conformationally  immature  and  may,  consequently, 
be  targeted  for  degradation  more  efficiently  than  its  mature 
counterpart.  This  enhanced  early  degradation,  previously 
termed  the  ‘zero-point’  effect  (Baker  and  Varshavsky, 
1991),  is  described  by  the  parameter  ID^  (Levy  et  al, 
1996).  It  was  found  that  a  large  fraction  of  the  zero-point 
effect  results  from  the  co-translational  degradation  of 
nascent  (being  synthesized)  polypeptide  chains,  which 
never  reach  their  mature  size  before  their  destruction 
by  processive  proteolysis  (G.Tumer  and  A.Varshavsky, 
unpublished  data).  The  detection  of  a  zero-point  effect 
requires  the  comparison  of  a  test  protein’s  degradation 
between  cells  containing  and  lacking  the  relevant  proteo¬ 
lytic  pathway.  Alternatively,  the  zero-point  effect  can  be 
detected  by  comparing,  through  the  UPR  technique,  the 
degradation  of  otherwise  identical  degron-containing  and 
degron-lacking  versions  of  a  test  protein  (Levy  et  al, 
1996,  1999).  Although  the  degradation  of  a  protein  during 
the  pulse  can  be  strikingly  high  (Levy  et  al,  1996) 
(see  also  below),  it  is  not  detectable  by  a  conventional, 
reference-lacking  pulse-chase  assay. 

Increasing  the  strength  of  N-degrons  by  spiking 
them  with  additional  lysines 

The  UPR  constructs  of  the  present  work  were  DHFR-ha- 
Ub^'^S-X-e^-Pgal  fusions.  They  contained  the  meta- 
bolically  stable,  ha-epitope-bearing  DHFR-ha-Ub*^'*^ 
moiety  as  a  reference  protein,  termed  dha-Ub  below.  The 
dha-Ub-X-e^-pgal  proteins  were  co-translationally  cleaved 
in  vivo,  yielding  the  test  protein  X-e^-Pgal  and  the 
reference  dha-Ub  (Figure  lA).  To  reduce  the  possibility 
that  the  C-terminal  Ub  moiety  of  dha-Ub  could  function 
as  a  ubiquitylation/degradation  signal,  the  K-48  residue 
of  Ub  (a  major  site  of  isopeptide  bonds  in  multi-Ub 
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chains)  was  converted  to  Arg,  which  cannot  be  ubi¬ 
quity  lated,  yielding  Ub^"^^  (Levy  et  al,  1996).  These  and 
related  fusions  (Figure  lA)  were  expressed  in  Sxerevisiae 
from  low  copy  plasmids  and  the  copper-inducible  Pct/pi 
promoter. 

X-e^-|3gal  is  an  extensively  analyzed  class  of  N-end 
rule  substrates,  which  contain  a  variable  N-terminal  residue 
X  (produced  through  the  DUB-mediated  cleavage  of 
dha-Ub-X-e^-pgal  at  the  Ub-X  junction),  a  40-residue 
N-terminal  extension  called  (see  Introduction),  and  a 
Pgal  moiety  lacking  the  first  24  residues  of  wild-type 
E.coli  Pgal  (Figure  lA).  If  K-15  and  K-17,  the  only 
lysines  of  the  e^  extension  (Figure  lA),  are  replaced  by 
Arg  residues,  which  cannot  be  ubiquity  lated,  the  resulting 
X-e^^-Pgal  is  long-lived  even  if  its  N-terminal  residue  is 
destabilizing  in  the  N-end  rule  (Bachmair  et  al,  1986; 
Johnson  et  al,  1990).  The  inactivity  of  N-degron  in  X- 
eAK-pgai  jg  caused  by  the  absence  of  targetable  lysines 
(Varshavsky,  1996).  Specifically,  the  multiple  lysines  of 
the  Pgal  moiety  in  X-e^^-Pgal  (Chau  et  al,  1989)  cannot 
serve  as  N-degron  determinants,  apparently  because  the 
most  N-terminal  Lys  residue  in  X-e^^-Pgal,  at  position 
239,  is  too  far  from  the  protein’s  N-terminus. 

One  of  our  aims  was  to  produce  stronger  N-degrons. 
We  chose  Tyr,  a  moderately  destabilizing  type  2  residue 
(Bachmair  and  Varshavsky,  1989;  Varshavsky,  1996),  as 
the  N-terminal  residue  of  an  initial  test  protein  (Figure  lA). 
More  strongly  destabilizing  N-terminal  residues,  e.g.  Leu 
or  Arg,  in  the  context  of  (expected)  stronger  N-degrons 
would  have  made  the  test  proteins  too  short-lived  for 
detection  in  a  pulse-chase  assay.  Met  was  employed  as  a 
stabilizing  N-terminal  residue.  The  term  ID^  below  (see 
Materials  and  methods)  conveys  the  extent  of  degradation 
of  a  protein  during  the  5  min  pulse,  in  comparison  with 
the  degradation,  during  the  same  pulse,  of  a  control 
(degron-lacking,  i.e.  Met-bearing)  version  of  the  same 
protein. 

To  determine  whether  the  degradation  of  Tyr-e^-Pgal 
in  Sxerevisiae  could  be  enhanced  through  the  addition  of 
Lys  residues  while  remaining  dependent  on  the  Ubrlp 
N-recognin,  the  sequence  Lys-Arg-Lys  (ICRK),  identic^ 
to  the  sequence  at  positions  15-17  of  e^,  was  inserted 
at  the  indicated  locations  within  e^  (Figure  lA).  The 
unmodified  Tyr-e^-pgal  had  an  ID^  of  -48%,  i.e.  -48% 
of  the  labeled  Tyr-e^-Pgal  was  destroyed  during  the  5  min 
pulse,  before  time  0.  The  t^^^  (half^-life  between  0  and 
10  min  of  chase)  of  Tyr-e^-pgal  was  ~26  min  (Figures  2A 
and  3A).  The  KRK  sequence  inserted  at  any  of  the 
indicated  three  positions  within  e^  (Figure  lA)  strongly 
destabilized  the  already  short-lived  Tyr-e^-Pgal:  for 
example,  Tyr-Kle^-pgal  (Figure  lA)  had  an  ID^  of  -75% 
and  t^^^  of  -5  min  (Figures  2A  and  3A).  The  increased 
degradation  of  Tyr-e^-pgal  derivatives  containing  extra 
KRK  remained  completely  Ubrlp-dependent:  Tyr-e^-Pgal, 
Tyr-Kle^-pgal,  Tyr-K2e*^-Pgal  and  Tyr-K3e^-pgal  were 
all  long-lived  proteins  (?o.5  >10  h)  in  ubrlA  cells 
(Figures  2A  and  3A).  In  addition,  Met-e^-pgal,  Met-Kle^- 
pgal,  Met-K2e^-Pgal  and  Met-K3e^-Pgal,  the  Met-bearing 
counterparts  of  the  Tyr-based  N-end  rule  substrates,  were 
long-lived  in  either  UBRl  or  ubrlA  cells  (data  not  shown). 

These  results  led  us  to  examine  the  effects  of  adding 
more  than  one  KRK  sequence  to  e^  (Figure  lA).  The 
resulting  Tyr-e^-Pgal  derivatives,  bearing  either  two 
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Fig.  2.  Active  N-degrons  can  be  strongly  enhanced  by  additional 
lysines.  (A)  Congenic  UBRl  fwt)  and  ubrlA  S.cerevisiae  that 
expressed  the  UPR-based  fusions  Met-e^-pgal  (Mek)  (DHFR-ha- 
UbR‘*8-Met-e'^-Pgal),  'lyr-e’^-pgal  (Yek),  'lyr-Kle'^-pgal  (YKl), 
'IVr-K2e'^-Pgal  (YK2)  and  Tyr-K3e'^-Pgal  (YK3)  (see  Figure  lA)  were 
labeled  with  [^^S]methionine/cysteine  for  5  min  at  30°C,  followed  by 
a  chase  for  0,  10  and  30  min,  extraction,  immunoprecipitation  with 
anti-ha  and  anti-Pgal  antibodies,  SDS-PAGE,  and  autoradiography  (see 
Materials  and  methods).  The  bands  of  X-pgal  (test  protein)  and 
DHFR-ha-Ub^^^  (reference  protein)  are  indicated  on  the  right.  (B)  As 
in  (A),  but  with  Met-e^-Pgal  (Mek),  Tyr-e^-pgal  (Yek),  Tyr-Kle*^-pgaI 
(YKl),  Tyr-KKle^-Pgal  (YKKl)  and  lyr-KKRle^-Pgal  (YKKKl) 

(see  Figure  lA).  (C)  As  in  (A),  but  with  Met-e^-pgal  (Mek),  His- 
e^-pgal  (Hek),  His-Kle^-Pgal  (HKl),  His-KKle*^-Pgal  (HKKl)  and 
His-KKKle^-Pgal  (HKKKl).  (D)  Metabolic  stability  of 
conformationally  mature  Tyr-e*^-pgal  and  its  KRK-spiked  derivatives. 
JD54  cells  expressing  Tyr-e^-pgal  (Yek),  Tyr-Kle^- 

pgal  (YKl)  or  Tyr-KKle^-pgal  (YKKl)  were  grown  in  SM-raffinose 
medium  (no  expression  of  Ubrlp),  then  labeled  with  [^^SJmethionine/ 
cysteine  for  10  min  at  30°C.  After  a  20  min  chase  in  SM-raffinose, 
galactose  was  added  to  3%  to  induce  Ubrlp  expression,  followed  by  a 
chase  for  1,  3  and  6  h,  and  the  analysis  of  immunoprecipitated  test 
proteins.  Lanes  1,  6  and  11,  the  end  of  labeling  (time  0).  Lanes  2, 

7  and  12,  the  end  of  20  min  chase  in  SM-raffinose.  Lanes  3,  8  and  13, 
1  h  chase  with  galactose.  Lanes  4,  9  and  14,  3  h  chase.  Lanes  5,  10 
and  15,  6  h  chase. 

(Tyr-KKle^-pgal)  or  three  (Tyr-KKKle^-pgal)  KRK 
sequences,  in  addition  to  the  original  KRK  of  e*^,  were 
extremely  short-lived  proteins,  even  though  N-terminal 
Tyr  is  a  weakly  destabilizing  residue  (Varshavsky,  1996). 
For  example,  Tyr-KKle^-jJgal  (Figure  lA)  had  ^§75^®  of 
~4  min  (in  comparison  with  ~26  min  in  the  case  of  Tyr- 
e^-pgal)  and  an  ID^  of  --94%.  In  other  words,  -94%  of 
the  labeled  Tyr-KKle^-Pgal  was  destroyed  during  the 
5  min  pulse,  before  time  0  (Figures  2B  and  3B).  At  the 
same  time,  all  of  these  proteins  were  long-lived  in  ubrIA 
cells  (Figures  2B  and  3B). 

The  N-terminal  lyr  is  bound  by  the  type  2  site  of 
N-recognin  (Ubrlp)  that  recognizes  substrates  bearing 
bulky  hydrophobic  N-terminal  residues  (Varshavsky, 
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Fig.  3.  Quantitation  of  degradation  of  the  test  proteins.  (A)  Pulse- 
chase  patterns  of  Met-e^-pgal  (Me^),  Tyr-e*^-pgal  (Ye^),  lyr-Kle^- 
pgal  (YKl),  Tyr-K2e^^-Pgal  (YK2)  and  TV^-KSe^-pgal  (YK3)  (see 
Figures  lA  and  2 A)  were  quantitated  using  the  UPR  technique  and 
Phosphorlmager  (see  Materials  and  methods).  Time  0  refers  to  the  end 
of  the  5  min  pulse;  100%  refers  to  the  relative  amount  of  Met-e^-pgal, 
normalized  against  the  reference  protein  dha-Ub^^^.  (B)  As  in  (A)  but 
with  Met-e^-pgal  (Me*^),  Tyr-e^-Pgal  (Ye^),  Tyr-Kle^-pgal  (YKl), 
Tyr-KKle^-pgal  (YKKl)  and  Tyr-KKKle^-pgal  (YKKKl)  (see 
Figures  lA  and  2B).  (C)  As  in  (A)  but  with  Met-e^-pgal  (Me^),  His- 
e^-Pgal  (He^),  His-Kle^-pgal  (HKl),  His-KKle^-pgal  (HKKl)  and 
His-KKKle^-Pgal  (HKKKl)  (see  Figures  lA  and  2C).  (D)  As  in  (A), 
but  quantitation  of  the  post-translational  degradation  of  Tyr-e^-Pgal 
(Ye^),  Tyr-Kle^-pgal  (YKl)  and  Tyr-KKle^-pgal  (YKKl)  in  the 
strain  JD54  (see  Figures  lA  and  2A),  following  the 
induction  of  Ubrlp  by  galactose.  Time  0  refers  to  the  end  of  the 
10  min  labeling  in  raffinose  (no  Ubrlp).  The  cells  were  incubated  for 
another  20  min  in  raffinose,  followed  by  the  addition  of  galactose  to 
induce  Ubrlp  (see  Materials  and  methods).  (E)  Relative  enzymatic 
activities  of  pgal  in  UBRl  cells  (filled  bars)  and  cells  (striped 
bars)  expressing  one  of  the  following  test  proteins:  Met-e*^-pgal 
(Me^),  lyr-eK-Pgal  (Ye*^),  Tyr-KleK-3gal  (YKl),  lyr-KKle^-pgal 
(YKKl),  Tyr-KKKlue'^-pgal  (YKKKl),  ffis-e^-Pgal  (He^),  His-Kle^- 
pgal  (HKl),  His-KKle^-pgal  (HKKl)  and  His-KKKBe'^-pgal 
(HKKKl)  (see  Materials  and  methods).  The  activities  of  pgal  were 
normalized  to  the  activity  of  Met-e^-Pgal  in  each  cell.  Values  shown 
are  the  means  from  duplicate  measurements,  which  yielded  results 
within  10%  of  the  mean  values. 


1996).  We  asked  whether  the  above  findings  were  also 
relevant  to  the  type  1  (basic)  destabilizing  N-terminal 
residues  Arg,  Lys  and  His,  which  are  bound  by  the  type  1 
site  of  Ubrlp.  Counterparts  of  the  Tyr-Pgal  fusions  that 
bore  N-terminal  His  (a  weak  type  1  destabilizing  residue) 
were  constructed  (Figure  lA)  and  tested  in  pulse-chase 
assays.  The  His  residue  was  chosen  as  a  type  1  destabilizing 
residue  in  these  tests  for  the  same  reason  as  Tyr  in  the 
preceding  tests:  a  stronger  destabilizing  residue  would 
have  made  the  measurements  impractical  with  extremely 
short-lived  substrates.  The  results  (Figures  2C  and  3C) 
confirmed  the  generality  and  specificity  of  degradation 
enhancement  by  the  additional  KRK  sequences.  For 
example,  His-Kle^-(3gal  had  an  ID^  of  -95%  (i.e.  -95% 
of  the  labeled  His-Kle®^-|3gal  was  destroyed  during  the 
5  min  pulse,  before  time  0),  in  comparison  with  the  ID^ 
of  -41%  for  His-e*^-Pgal;  the  corresponding  foTs'*’  values 
were  -5  min  and  -8  min  for  His-Kle^-Pgal  and  His-e*^- 
pgal,  respectively  (Figure  3C). 

Similar  to  the  results  with  Tyr-bearing  substrates,  their 
His-bearing,  multiple  KRK-containing  counterparts  were 
long-lived  in  ubrlis.  cells  (Figures  2C  and  3C).  The 
only  exception  was  His-KKKle’^-Pgal  (Figure  lA),  which 
contained  three  KRK  sequences,  in  addition  to  the  KRK 
of  the  original  e®^:  in  contrast  to  Tyr-KKKle’^-Pgal,  His- 
KKKle'^-pgal  was  stabilized  strongly  but  incompletely  in 
the  ubrlK  genetic  background  (Figure  3C).  Thus,  the 
His-KKKle^  extension,  in  contrast  to  the  Tyr-KKKle’^ 
extension  (Figure  lA),  appears  to  contain  a  Ubrlp-inde- 
pendent  degron. 

Previous  work  (Madura  et  al,  1993;  Kwon  et  al,  1999) 
has  shown  that  the  steady-state  level  of  an  X-Pgal  protein 
(determined  by  measuring  the  enzymatic  activity  of  Pgal 
in  yeast  extracts)  is  a  sensitive  measure  of  its  metabolic 
stability.  The  results  of  this  steady-state  assay  were  in 
agreement  with  those  derived  from  pulse-chase  measure¬ 
ments:  the  level  of  Tyr-e'^-Pgal  in  UBRl  cells  was  53% 
of  the  level  of  the  long-lived  Met-e’^-pgal,  whereas  Tyr- 
Kle^-Pgal  was  present  at  4%  of  the  Met-e^-Pgal  level, 
and  the  concentration  of  Tyr-KKKle‘^-pgal  was  virtually 
indistinguishable  from  the  assay’s  background  (cells  trans¬ 
formed  with  vector  alone)  (Figure  3E).  Crucially,  the 
levels  of  these  extra  KRK-bearing  'I^r-e’^-Pgal  fusions 
in  ubrlA  cells  became  similar  to  that  of  Met-eKpgal 
(Figure  3E),  in  agreement  with  the  pulse-chase  data 
(Figures  2A,  B  and  3A,  B). 

Although  the  addition  of  extra  KRK  sequences  to 
e^  yielded  considerable  decreases  in  the  of  the 
corresponding  N-end  rule  substrates,  by  far  the  major 
effect  of  multiple  KRK  sequences  was  on  the  decay 
curves’  ID^  term,  which  conveys  the  extent  of  degradation 
of  a  protein  during  or  shortly  after  its  synthesis 
(Figures  2A-C  and  3A-C).  To  examine  this  issue  in  a 
different  way,  Tyr-e^-Pgal,  Tyr-Kle^-pgal  and  Tyr- 
KKleKpgal  (Figure  lA)  were  produced  in  the  JD54 
S.cerevisiae  strain,  which  expressed  Ubrlp  from  the  galac¬ 
tose-inducible,  dextrose-repressible  promoter.  JD54 
cells  expressing  one  of  the  test  proteins  were  labeled  in 
raffinose-containing  SR  medium  (no  Ubrlp),  incubated 
for  20  min  in  the  same  medium  and  thereafter  shifted  to 
galactose,  where  Ubrlp  was  induced.  Even  though  the 
N-end  rule  pathway  became  hyperactive  in  the  presence 
of  galactose  (Madura  and  Varshavsky,  1994;  Ghislain 


6020 


Degradation  signals  in  the  Lys-Asn  sequence  space 


et  al,  1996;  data  not  shown),  the  pre-labeled  substrates 
Tyr-e^-Pgal  and  Tyr-Kle’^-Pgal  were  barely  degraded  after 
the  induction  of  Ubrlp;  Tyr-KKle^-Pgal  was  degraded 
only  slightly  (Figures  2D  and  3D). 

These  findings  were  consistent  with  the  earlier 
evidence  for  a  strong  retardation  of  the  post-translational 
degradation  of  Arg-e’^-Pgal  under  the  same  conditions 
(R.J.Dohmen  and  A. Varshavsky,  unpublished  data).  Thus, 
in  contrast  to  a  newly  formed,  conformationally  immature 
Pgal-based  test  protein,  a  conformationally  mature  Pgal 
tetramer  is  a  poor  substrate  of  the  N-end  rule  pathway 
even  in  the  presence  of  N-degron  enhancements  such 
as  the  additional  KRK  sequences.  It  is  the  pgal  moiety 
of  these  test  proteins  (Figure  lA)  that  was  responsible 
for  the  time-dependent  decline  in  the  rate  of  degradation, 
because  the  kinetics  of  in  vivo  degradation  of  e^-DHFR- 
based  N-end  rule  substrates  was  much  closer  to  first- 
order  decay  (Levy  et  al.,  1999;  data  not  shown). 

Locating  N-degrons  in  the  iysine-asparagine 
sequence  space 

The  earlier  work,  which  led  to  the  bipartite  model  of 
N-degron  (Bachmair  and  Varshavsky,  1989;  Hill  et  al., 
1993),  and  particularly  the  present  findings  about  the 
effects  of  adding  KRK  sequences  to  an  e’^-based  N-degron 
(Figures  2  and  3)  suggested  that  a  substrate’s  destabilizing 
N-terminal  residue  and  a  sterically  suitable  internal  Lys 
residue  (or  residues)  are  the  two  necessary  and  sufficient 
components  of  an  N-degron.  However,  since  both  the 
e'^-based  and  other  previously  analyzed  N-degrons  are 
embedded  in  complex  sequence  contexts  (deGroot  et  al, 
1991;  Dohmen  et  al,  1994;  Varshavsky,  1996),  we  wished 
to  address  the  bipartite-degron  model  by  constmcting  an 
N-degron  from  much  simpler  sequence  motifs.  Should 
this  prove  feasible,  we  also  wanted  to  explore  constraints 
on  the  structure  of  N-degrons  through  a  screen  in  a  simpler 
sequence  setting.  If  the  sequence  space  could  be  reduced 
strongly  enough,  one  advantage  of  such  a  screen  would 
be  its  exhaustiveness.  The  AAA  codon  for  lysine  differs 
by  just  one  third-letter  substitution  from  the  codon  for 
asparagine  (AAU),  a  polar  uncharged  residue.  Thus,  one 
could  attempt  a  screen  for  N-degrons  in  the  sequence 
space  of  two  amino  acids:  Lys  (K)  and  Asn  (N). 

A  double-stranded  oligonucleotide  that  encoded  random 
14-residue  K/N  sequences  (see  Materials  and  methods) 
was  used  to  replace  the  sequence  encoding  14  residues  of 
e^  immediately  following  the  residue  X  (Figure  IB).  In 
the  resulting  test  proteins,  this  latter  sequence,  HGSG- 
AWLLPVSLVRS  (plus  the  sequence  RS),  followed  the 
quasi-random  14-residue  K/N  sequence  (Figure  lA).  The 
resulting  K/N-based  extensions  either  lacked  the  lysines 
or  contained  a  variable  number  of  them  between  residues 
2  and  16.  The  K-17  of  e*^  (the  only  other  lysine  in  e*^) 
was  replaced  by  Arg.  In  these  test  fusions,  dha-Ub’^‘‘*- 
Arg-(K/N)i4-e^-Pgals,  Arg  was  used  as  a  destabilizing 
N-terminal  residue  (Figure  IB).  The  number  of  different 
14-residue  sequences  containing  exclusively  K  or  N  is 
2’^  =  16  384.  The  bulk  of  a  library  of  this  complexity  could 
be  encompassed  with  conventional  screening  methods. 
Testing  of  the  pRKN14-based  library  by  amplifying  it  in 
E.coli  indicated  that  >90%  of  the  plasmids  contained  an 
oligonucleotide  insert.  The  pRKN14  library  was  intro¬ 
duced  into  S.cerevisiae  JD54  (Ghislain  et  al,  1996),  which 


expressed  Ubrlp  from  the  Vqau  promoter,  and  screened 
for  colonies  that  stained  blue  with  XGal  [high  levels  of 
Arg-(K/N)i4-e'^-|igal]  on  dextrose  (SD)  plates  but  stained 
white  [low  levels  of  Arg-(K/N)i4-e^-13gal]  on  replica- 
plated  galactose  (SG)  plates.  Approximately  20  000  colon¬ 
ies  were  screened  this  way.  A  total  of  68  isolates  were 
identified  in  which  the  activity  of  Pgal  was  significantly 
higher  in  the  absence  than  in  the  presence  of  the  N-end 
rule  pathway. 

The  corresponding  Arg-(K/N)i4-eApgal  test  proteins 
were  expressed  in  congenic  ubrllS.  and  UBRl  strains,  and 
the  ratio  of  Pgal  activities  was  determined  for  each  of  the 
test  proteins.  The  results  are  summarized  in  Figure  4, 
which  shows  the  K/N  sequences  of  the  30  most  active 
N-degrons,  and  the  ratios  of  the  corresponding  Pgal 
activity  in  the  ubrIA  strain  to  that  in  the  UBRl  strain 
(higher  ratios  indicate  stronger  N-degrons).  Remarkably, 
the  strongest  K/N-based  N-degron  was  found  to  be  more 
active  than  the  strongest  e^-based  N-degron  (Figure  4). 
Black  bars  in  Figure  4  denote  Pgal  activity  derived  from 
constructs  carrying  fC/N-based  N-degrons  with  lysines 
present  at  positions  3  and  15;  the  strongest  N-degrons 
were  largely  of  this  class  (Figure  4).  K-15  was  present  in 
the  15  strongest  K/N-based  N-degrons  except  one  (clone 
132),  which  had  K  at  position  14,  and  was  also,  presumably 
in  compensation  for  the  absence  of  K-15,  one  of  the  most 
lysine-rich  N-degrons  in  this  set  (Figure  4).  Similarly,  K-3 
was  present  in  the  15  strongest  BC/N-based  N-degrons 
except  three  (clones  3,  77  and  138)  (Figure  4).  All  of 
these  exceptional  clones  bore  K-15;  in  addition,  one  of 
them  (clone  77)  bore  K-4  and  K-5,  as  well  as  K-14  and 
K-15  (Figure  4). 

A  completely  uniform  feature  of  all  68  K/N-based 
N-degrons  was  the  absence  of  K  from  position  2  (Figure  4; 
data  not  shown),  consistent  with  the  fact  that  all  of  the 
previously  examined  N-degrons  (Varshavsky,  1996)  also 
lacked  a  strongly  basic  residue  (Arg  or  Lys)  at  position  2. 
To  address  this  issue  directly,  we  used  site-directed  muta¬ 
genesis,  replacing  asparagine  at  position  2  of  clone  119 
(the  strongest  K/N-based  N-degron;  Figure  4)  with  lysine. 
The  resulting  test  protein  was  long-lived  in  UBRl  cells 
(data  not  shown),  confirming  that  lysine  is  not  tolerated 
at  position  2  of  an  N-degron. 

Even  though  all  of  the  strongest  K/N-based  N-degrons 
contained  at  least  three  lysines,  the  total  number  of  lysines 
in  a  K/N-based  N-degron  was  not  a  strong  predictor  of 
its  activity,  and  the  arrangement  of  additional  lysines 
between  positions  3  and  15  did  not  correlate,  in  an  obvious 
way,  with  the  activity  of  N-degrons  (Figure  4).  None  of 
the  15  strongest  K/N-based  N-degrons  had  more  than  six 
lysines  (most  had  from  three  to  five)  (Figure  4),  indicating 
that  the  density  of  lysines  per  se  is  not  the  main  feature 
of  a  strong  K/N-based  N-degron.  Note  that  our  screen 
rejected  K/N-based  degrons  that  exhibited  a  significant 
Ubrlp-independent  activity.  This  may  in  part  account  for 
the  upper  limit  on  the  number  of  lysines  in  the  strongest 
N-degrons:  more  lysine-rich  K/N  sequences  could  contain 
motifs  recognized  by  non-N-end  rule  pathways,  similarly 
to  the  above  His-KKKl-pgal  test  protein  (Figures  lA,  2C 
and  3C). 

The  in  vivo  degradation  of  Arg-(K/N)i4-e'^-3gal  proteins 
bearing  K/N-based  N-degrons  [produced  from  the  dha- 
UbR48-Arg-(K/N)i4-e^-Pgal  fusions  (Figure  IB)]  was  com- 
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Residue  position  pgal  activity  ratio  (  ubr1Ace\\s/UBR1  ceils) 


Fig.  4.  N-degrons  in  the  K/N  sequence  space.  The  deduced  sequences  of  the  identified  K/N  N-degrons  are  shown  in  conjunction  with  the  bar 
diagram  of  their  relative  activity,  defined  as  the  ratio  of  pgal  activities  in  the  ubrlA  versus  UBRl  cells  expressing  a  given  Arg-*(K/N)i4-e'^-pgal  test 
protein.  The  top  bar  (vertical  stripes)  indicates  the  relative  activity  of  N-degron  in  the  original  Arg-e^-Pgal  (Re*^).  Thirty  14-residue  K/N  extensions 
with  the  highest  Ubrlp-dependent  destabilizing  activity  are  listed,  out  of  the  total  of  68  isolates  that  were  metabolically  unstable  in  the  presence  but 
not  in  the  absence  of  Ubrlp.  The  total  number  of  possible  14-residue  K/N  sequences  is  16  384  (see  the  main  text).  The  Lys  and  Asn  residues  are 
denoted  as  the  letter  K  and  a  hyphen,  respectively.  Position  1  in  each  clone  was  occupied  by  Arg.  The  extensions  in  which  lysines  were  present 
either  at  positions  3  and  15,  or  only  at  15,  or  only  at  3,  are  marked,  respectively,  by  the  filled,  striped  and  open  bars. 


pared  with  the  degradation  of  Arg>e^-Pgal  in  pulse-chase 
assays  (Figure  5A  and  B).  The  chosen  K/N  degrons 
(clones  119, 113  and  4)  were  the  two  most  active  ones  in 
the  Pgal  assay  (clones  119  and  113)  (Figure  4),  and  a 
relatively  weak  one  (clone  4)  (Figure  6A).  As  expected 
from  the  results  of  steady-state  pgal  assays  (Figures  4), 
Arg-(K/N)i4-eApgal  proteins  119  and  113  were  extremely 
short-lived  in  UBRI  cells  (Figure  5A),  in  contrast  to  their 
stability  in  ubrlA  cells  (Figure  5B).  As  with  Arg-e^-Pgal, 
the  bulk  of  degradation  of  Arg-(K/N)i4-e^-Pgals  took 
place  either  during  or  shortly  after  their  synthesis,  so  that 
even  at  time  0  (at  the  end  of  the  5  min  pulse)  the  test 
proteins  could  be  detected  only  by  overexposing  the 
autoradiograms  (Figure  5A;  data  not  shown;  compare  with 
Figure  5B).  In  contrast,  Arg-(K/N)i4-e^-pgal  bearing  the 
weaker  N-degron  of  clone  4  was  readily  detectable  at  time 
0,  and  decayed  more  slowly  afterwards  (Figure  5A),  in 
agreement  with  the  results  of  steady-state  Pgal  assays 
(Figure  6B).  The  distribution  of  three  lysines  in  the 
strongest  N-degron  (clone  119:  Lys3,  8,  15)  (Figure  4) 
was  similar  to  that  in  a  relatively  weak  one  (clone  4: 
Lys4,  12,  14)  (Figure  6),  emphasizing  the  importance  of 
lysines  at  positions  3  and  15. 

To  address  in  more  detail  the  relative  contributions  of 
the  lysines  at  positions  3  and  15  to  the  activity  of  a 
K/N-based  N-degron,  site-directed  mutagenesis  was  used 
to  construct  the  otherwise  identical  N-degrons  that  con¬ 
tained  either  exclusively  Lys3  and  Lys  15  (no  lysine  at  a 
third,  interior  position),  exclusively  Lys3  or  exclusively 
Lysl5  (Figure  6A).  The  resulting  Arg-(K/N)i4-eApgal 


proteins  were  examined  using  both  steady-state 
(Figure  6B)  and  pulse-chase  assays  (Figure  5C  and  D). 
The  K/N-based  N-degron  that  contained  only  Lys3  and 
Lysl5  was  active  but  considerably  less  so  than  the  strongest 
K/N-based  N-degron  of  clone  119,  which  contained  Lys8 
as  well  (Figures  4,  5C  and  6).  The  elimination  of  either 
Lys3  or  Lys  15  further  weakened  the  N-degron,  so  that  the 
Ly$3  and  Lys  15  versions  of  Arg-(K/N)i4-eApgal  were, 
respectively,  slightly  and  moderately  short-lived  proteins 
(Figures  5C,  D  and  7). 

Mechanistic  impiications 

One  finding  of  this  work  is  that  an  N-degron  can  be 
greatly  strengthened  by  spiking  it  with  additional  lysine 
residues.  The  resulting  degradation  signals  remained 
specific:  nearly  all  of  the  enhanced  N-degrons  were 
completely  inactive  in  ubrlA  cells,  which  lacked  the  E3 
(recognition)  component  of  the  N-end  rule  pathway.  In 
addition,  the  enhanced  N-degrons  could  be  inactivated  by 
replacing  their  destabilizing  N-terminal  residue  with  Met, 
a  stabilizing  residue  in  the  N-end  rule. 

The  strength  of  a  K/N-based  N-degron  depends  on  the 
arrangement  of  lysines  in  the  14-residue  N-terminal  region 
of  the  test  protein.  This  dependence  is  both  strong  and 
complex  (Figures  4  and  6).  The  patterns  observed  may 
reflect  distinct  conformational  flexibilities  of  different 
K/N  sequences  vis-a-vis  the  relatively  fixed  spatial  arrange¬ 
ment  of  the  type  1  or  2  sites  of  Ubrlp  (N-recognin)  and 
its  associated  Ubc2p  E2  enzyme  (Madura  et  al,  1993). 
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Fig.  5.  Pulse-chase  analysis  of  K/N-based  N-degrons.  (A)  JD47-13C 
(UBRl)  Sxerevisiae  that  expressed  the  UPR-based  fusions  Met-e^- 
Pgal  (Mek)  (DHFR-ha-Ub'^‘‘®-Met-e‘'-|3gal),  R-e'^-|3gal  (Rek),  clone 
113  of  Aig-(K/N),4-e^-|igal  (KN113),  clone  119  of  Atg-(KVN),4-e'^- 
Pgal  (KN119),  clone  4  of  Aig-(K/N),4-e‘^-Pgal  (KN4)  and  other  clones 
(see  Figures  IB  and  4)  were  labeled  with  P^S]methionine/cysteine  for 
5  min  at  30°C,  followed  by  a  chase  for  0,  10  and  30  min,  extraction, 
immunoprecipitation  with  anti-ha  and  anti-pgal  antibodies,  SDS-PAGE 
and  autoradiography  (see  Materials  and  methods).  The  bands  of 
X-pgal  (test  protein)  and  DHPR-ha-Ub^"^^  (reference  protein)  are 
indicated  on  the  right.  (B)  As  in  (A),  but  with  the  congenic  JD55 
{ubrl^)  cells.  (C)  As  in  (A)  but  with  the  clones  K3,  K15  and  K3K15 
(see  Figure  6).  (D)  As  in  (C)  but  with  JD55  (ubrlA)  cells. 

Some  of  the  conclusions  indicated  by  our  data  are 
described  below. 

(i)  A  completely  uniform  feature  of  all  68  K/N-based 
N-degrons  was  the  absence  of  Lys  from  position  2 
(Figure  4;  data  not  shown),  consistent  with  the  fact  that 
all  of  the  previously  examined  N-degrons  (Varshavsky, 
1996)  also  lacked  a  strongly  basic  residue  (Arg  or  Lys) 
at  position  2.  Using  site-directed  mutagenesis  with  clone 
119  (Figure  4),  we  confirmed  that  lysine  is  not  tolerated 
at  position  2  of  an  N-degron  (data  not  shown). 

(ii)  The  strongest  K/N-based  N-degrons  contained  single 
Lys  residues,  surrounded  by  Asn  residues  (Figure  4),  in 
contrast  to  the  Lys-Arg-Lys  (KRK)  motif  present  in  the 
e^  extension  of  N-degrons  studied  previously  (Figure  lA). 
We  conclude  that  a  lysine-containing  motif  of  three 
adjacent  basic  residues  is  not  an  essential  feature  of  an 
N-degron. 

(iii)  The  presence  of  Lys  15  in  all  of  the  most  active 
K/N-based  N-degrons  (Figure  4)  is  consistent  with  the 
earlier  model  (Bachmair  and  Varshavsky,  1989;  Varshav¬ 
sky,  1996)  in  which  a  targetable  Lys  residue  should 
be  sufficiently  far  along  the  chain  from  a  destabilizing 
N-terminal  residue  to  allow  the  formation  of  a  loop  that 
positions  this  lysine  spatially  close  to  the  N-terminal 
residue.  This  model  is  also  consistent  with  the  finding 
that  Lys3  is  another,  nearly  invariant,  component  of  a 
K/N-based  N-degron.  Specifically,  Lys3  may  be  located 
at  the  uniquely  favorable  ‘linear’  distance  from  the 


p-galactosidase  activity 

Fig.  6.  Site-directed  mutagenesis  of  N-degrons  identified  through  the 
screen  in  the  lysine-asparagine  sequence  space.  (A)  K/N  sequences  of 
clones  119,  113  and  4  and  the  K3/K15  derivatives  of  clone  119  (K3, 
K15  and  K3K15).  The  relative  activity  of  these  N-degrons,  defined  as 
the  ratio  of  Pgal  activities  in  the  ubrIA  versus  UBRl  cells  expressing 
a  given  Arg-(K/N)i4-e^-pgal  protein,  is  indicated  on  the  right.  (B)  The 
levels  of  pgal  activity  in  extracts  from  UBRl  cells  (filled  bars)  and 
ubrlA  cells  (striped  bars)  cells  expressing  the  indicated  test  proteins. 
Standard  deviations  (for  triplicate  measurements)  are  indicated. 


wt  UbrlA 


Fig.  7.  Decay  curves  of  K3  and/or  K15  mutants  of  Arg-(K/N)i4-e^- 
pgal  in  UBRl  and  ubrIA  S.cerevisiae.  Pulse-chase  patterns  of  Met-e^- 
pgal  (Me^),  Arg-e*^-pgal  (Re*^),  and  Lys3  (K3),  Lys  15  (K15)  and 
Lys3/Lysl5  (K3K15)  variants  of  Arg-(K/N)i4-e^-pgaI  in  Figure  5C 
and  D  were  quantitated  as  described  in  the  legend  to  Figure  3  and 
Materials  and  methods. 

N-terminal  residue,  allowing  its  proximity  to  the  destabiliz¬ 
ing  N-terminal  residue  in  the  absence  of  loop  formation. 
That  this  view  is  at  best  incomplete  is  indicated  by  the 
finding  that  either  Lys3  alone  or  Lys  15  alone  cannot 
substitute  for  the  combination  of  Lys3,  Lys8  and  Lys  15 
that  defines  the  strongest  K/N-based  N-degron  identified 
to  date  (clone  119  in  Figures  4  and  6).  Moreover,  even 
Lys3  and  Lys  15  together,  in  the  absence  of  Lys8,  result 
in  a  much  weaker  N-degron  that  the  three-lysine  N-degron 
of  clone  119  (Figure  6),  One  possibility  is  that  Lys  15  is 
the  only  lysine,  among  the  three,  that  can  function  as  the 
site  of  ubiquitylation,  and  that  lysines  at  positions  3  and 
8  are  required  largely  for  optimal  conformational  flexibility 
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of  the  14-residue  K/N  region  in  this  N-degron.  It  is 
unlikely  that  the  much  higher  activity  of  the  clone  119 
(three-lysine)  N-degron  resulted  simply  from  its  higher 
positive  charge,  in  comparison  with  the  two-lysine 
N-degrons,  because  most  of  the  other  K/N  sequences 
containing  three  or  more  lysines  were  much  less  active 
than  the  clone  119  N-degron  (Figure  4). 

We  searched  for  N-degrons  in  a  K/N  region  14  residues 
long  (Figures  IB  and  4),  in  part  because  the  search  with 
a  significantly  longer  region  would  have  precluded 
the  screen  from  being  exhaustive.  Nevertheless,  since  the 
e^  extension  of  the  earlier  N-degrons  is  40  residues  long 
(Figure  lA),  and  since  a  part  of  the  e^  extension  was 
retained,  downstream  of  the  K/N  region,  in  our  test 
proteins  (see  Materials  and  methods),  it  remains  to  be 
determined  whether  a  K/N-based  N-degron  could  be  made 
even  stronger  by  replacing  the  entire  e^  with  a  relevant 
K/N  motif.  For  example,  one  could  take  advantage  of  the 
already  identified  14-residue  K/N  sequence  of  clone  119 
(Figure  4),  and  carry  out  an  analogous  screen  for  N-degrons 
in  which  the  K/N  sequence  of  clone  119  is  fixed,  while 
the  downstream,  e^-derived  sequence  is  replaced  by  ran¬ 
dom  K/N  motifs.  In  sum,  the  K/N  strategy  of  this  work 
is  far  from  exhausted  by  the  present  screen,  and  can  be 
used,  for  example,  to  produce  even  stronger  N-degrons, 
to  search  for  other  K/N-based  degradation  signals  and  to 
probe  the  targeting  mechanisms  of  the  Ub  system. 

Concluding  remarks 

One  implication  of  our  results  is  the  possibility  of  con¬ 
structing  much  stronger  N-degrons.  These  portable 
degradation  signals  can  be  used  to  render  proteins  of 
interest  that  are  short-lived  in  either  conditional  or  uncondi¬ 
tional  settings  (Dohmen  et  al,  1994;  Worley  etal,  1998). 

Another  implication  of  our  results  stems  from  the 
demonstrated  feasibility  of  defining  a  specific  class  of 
degradation  signals  in  a  sequence  space  of  just  two  amino 
acids.  Since  the  N-degrons  have  been  defined  previously 
in  much  more  complex  sequence  contexts  (Varshavsky, 
1996),  our  findings  suggest  that  other  classes  of  degrada¬ 
tion  signal,  and  some  of  the  other  targeting  signals  as 
well,  could  also  be  identified  and  examined  in  this  simple- 
sequence  setting.  Low-complexity  sequences  eliminate 
some  of  the  informational  ‘noise’  of  natural  sequences 
and  thereby  help  to  define  the  major  determinants  of 
structural  specificity.  The  advantages  of  the  simple- 
sequence  approach  have  long  been  recognized  by  research¬ 
ers  who  study  the  fundamentals  of  protein  folding  (Clarke, 
1995).  To  our  knowledge,  the  present  work  is  the  first  to 
extend  the  simple-sequence  approach  to  the  realm  of 
protein  degradation. 

Materials  and  methods 

Strains,  media  and  genetic  techniques 

Saccharomyces  cerevisiae  strains  used  in  this  work  were  JD47-13C 
(AMra  ura3-52  lys2-801  trpl-A63  his3-A200  leu2-3  112)  (Madura 
et  al,  1993),  JD55  (MATa  ura3-52  lys2-801  trpl-A63  his3-A200  l€u2-3 
112  ubrl-Al::HIS3)  (Madura  and  Varshavsky,  1994)  and  JD54  {MATk 
ura3-52  lys2-801  trpl-A63  his3-A200  leu2-3 112  GAU::UBR1)  (Ghislain 
et  al,  1996).  Rich  (YPD)  medium  contained  1%  yeast  extract,  2% 
peptone  (Difco)  and  2%  glucose.  Synthetic  media  (Ausubel  et  al,  1996) 
contained  either  2%  dextrose  (glucose)  (SD  medium),  3%  raffinose  (SR 
medium)  or  3%  galactose  (SG  medium).  To  induce  the  ^cupi  promoter, 


CUSO4  was  added  to  a  final  concentration  of  0.2  mM.  Cells  were 
incubated  and  assayed  at  30°C.  Transformation  of  S, cerevisiae  was 
carried  out  using  the  lithium  acetate  method  (Ausubel  et  ai,  1996). 

Piasmid  construction 

The  plasmids  encoded  Ub  fusions  of  the  UPR  technique  (Levy  et  al, 
1996)]  (see  Results  and  discussion).  The  reference  protein  was  mouse 
DHFR  fused,  through  a  20-residue  spacer  containing  the  ha  epitope,  to 
the  Ub*^^^  moiety  bearing  Arg  instead  of  wild-type  Lys  at  position  48 
(Levy  et  ai,  1996).  The  plasmids  pDhaUbXeKpgal  expressed,  from  the 
^CUPi  promoter,  the  fusions  DHFR-ha-Ub*^^^-X-e^-Pgal,  where  the 
junctional  residue  X  was  either  Met,  Tyr  or  His;  e*^  was  a  previously 
described  40-residue,  E.coli  Lac  repressor-derived  N-terminal  extension 
(Bachmair  and  Varshavsky,  1989;  Johnson  et  al,  1992);  the  Exoli  pgal 
moiety  lacked  the  first  24  residues  of  wild-type  Pgal  (Bachmair  et  al, 
1986).  The  Pgal-coding  fragment  was  produced  by  PCR  amplification 
of  the  Pgal  open  reading  frame  (ORF)  in  pUB23  (Bachmair  et  al,  1986; 
Bachmair  and  Varshavsky,  1989),  using  primers  BGALSl 
(5'-CAGAGATCTCTTAATCGCCTTGCAGCA-3')  and  RSAS2  (5'-C- 
CCTCGAGGTCGACGGTATCG-3'),  and  the  Expand  PCR  System 
(Boehringer,  Indianapolis,  IN).  The  Bglll/Xhol<m  PCR  product  was 
ligated  to  the  insert-lacking,  ?cuPl  promoter-containing  BgHl-Xhol 
fragment  of  pdUbXekUra3,  a  pRS314-based,  low  copy  plasmid  (F.Levy 
and  A.Varshavsky,  unpublished  data).  ORFs  that  expressed  DHFR-ha- 
Ub^'^^-X-e^-pgal  containing  a  modified  region  [single,  double  or 
triple  insertions  of  the  sequence  Lys-Arg-Lys  (KRK)  (Figure  lA)]  were 
produced  using  PCR.  Specifically,  pDhaUbXKiPgal,  pDhaUbXK2pgal. 
pDhaUbXK3pgaI,  pDhaUbXKKiPgal  and  pDhaUbXKKKiPgal  were 
constructed  by  replacing  the  BamUl-Bglll  fragment  encoding  X-e^-Pgal 
with  DNA  fragments  amplified  using,  respectively,  the  sense  primers 
EKKSl  (5'-CACGGATCCAAGAGAAAGGGAGCTTGGCTGTTGC- 
CC-3'),  EKKS2  (5'-CACGGATCCGGAGCTTGGCTGAAGAGAAAG- 
TTGCCCGTCTCACTGGTG-3'),  EKKS3  (5'-CACGGATCCGGAG- 
CTTGGCTGTTGCCCGTCTCAAAGAGAAAGCTGGTGAAAAGAA- 
AAACC-3'),  EKKS4  (5'-CACGGATCCAAGAGAAAGGGAGCTTG- 
GCTGAAGAGAAAGTTGCCCGTCTCACTGGTG-3')  and  EKKKSl 
(5'-CACGGATCCAAGAGAAAGGGAGCTTGGCTGAAGAGAAAG- 
TTGCCCGTCTCAAAGAGAAAGCTGGTGAAAAGAAAAACC-3'). 
The  primer  EKASl  (5'-GGAAGATCTCTGCATTAATGAATC-3')  and 
the  plasmid  pDhaUbMeKpgal  served  as  the  antisense  primer  and  the 
template,  respectively.  The  plasmids  pDhaUbRK3eAPgal,  pDhaUbRKlS- 
eApgal  and  pDhaUbRK3K15eApgal,  which  expressed,  respectively, 
DHFR-ha-Ub^'^^-Arg-Asn-Lys-(Asn)i2-e^-Pgal,  DHFR-ha-Ub'^^^-Arg- 
(Asn)i3-Lys-e^-pgaI  and  DHFR-ha-Ub^^^-Arg-Asn-Lys(Asn)ii-Lys-e^- 
Pgal,  were  constructed  as  follows.  The  term  e^  denotes  the  sequence 
HGSGAWLLPVSLVRS,  a  13-residue  derivative  of  the  40-residue  e^ 
(residues  2-14),  followed  by  two  residues  Arg  and  Ser,  encoded  by  the 
Bglll  site  (Figure  IB).  Double-stranded  oligonucleotides  that  encom¬ 
passed  the  5'  end  of  the  Sacll  site,  the  end  of  Ub-Arg-(Lys/Asn)]4- 
coding  sequences  (see  below),  and  the  3'  end  of  the  BamUl  site  were 
digested  with  and  BamHL  The  resulting  fragments  were  ligated  to 
the  Sacll-BamHL  vector-containing  fragment  of  pDhaUbReAPgal.  The 
latter  plasmid  was  produced  by  inserting  annealed  complementary 
oligonucleotides  EDKSl  and  EDASl,  which  encoded  the  e"^  sequence 
HGSGAWLLPVSLVRS  (see  above),  into  BamHI/Bg/II-cut  pDhaUbReK- 
Pgal.  All  constructs  were  verified  by  nucleotide  sequencing. 

A  screen  for  N-degrons  in  the  iysine^asparagine  sequence 
space 

The-  library  of  pDhaUbReAPgal- derived  plasmids  was  constructed  that 
expressed,  from  the  Pcupi  promoter,  Arg-(K/N)j4-e^-Pgal  proteins  that 
contained  random-sequence  14-residue  Lys/Asn(K/N)-inserts  between 
the  N-terminal  Arg  and  the  moiety  (Figure  IB).  Arg-(K/N)i4-e^-Pgals 
were  the  products  of  co-translational  cleavage  of  DHFR-ha-Ub^^^-Arg- 
(Mi)i4-e'^-Pgals  (Figure  IB).  To  produce  the  library,  a  method  for 
cloning  random-sequence  oligonucleotides  produced  by  mutually  primed 
synthesis  was  used  (Oliphant  et  al,  1986;  Ghislain  et  al,  1996).  A  set 
of  oligonucleotides  5^-CGCCCGCGGTGGTAGGr  A  A  Am^  ^C  ACn. 
GATCCG-3'  that  contained  random  permutations  of  14  codons,  either 
AAA  (encoding  Lys)  or  AAT  (encoding  Asn),  flanked  by  the  SacTl  and 
BamHl  sites  (underlined),  was  synthesized.  The  oligonucleotides  were 
converted  into  their  double-stranded  counterparts  with  Klenow  Pol  I 
(Ausubel  et  al,  1996),  and  were  digested  with  SacII  and  BamHl,  yielding 
a  set  of  equal-length  fragments  containing  randomly  permuted  (AAA/ 
T)i4  inserts.  The  fragments  were  ligated  to  the  Sacll-BamHl  vector- 
containing  fragment  of  pDhaUbReApgal.  The  resulting  library,  in  the 
plasmid  termed  pRKN14,  was  introduced  into  E.coli  DH5aby  electropor- 
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ation  (Ausubel  et  ai,  1996).  Digestion  of  the  pool  of  recovered  plasmids 
with  appropriate  restriction  enzymes  showed  that  >90%  of  transformants 
contained  an  oligonucleotide-derived  insert.  The  pRKN14  library  was 
transformed  into  S.cerevisiae  JD54,  in  which  Ubrlp  was  expressed  from 
the  Pgal/  promoter.  Approximately  2  X  10^  transformants  growing  on 
SD(-Trp)  plates  containing  0.2  mM  CUSO4  were  replica-plated  onto 
CuS04-containing  SG(-Trp)  plates.  The  XGal-based  filter  assay  (Ausubel 
et  al ,  1 996)  was  used  to  screen  for  colonies  that  were  white  (low  levels 
of  Pgal)  on  SG(-Trp)  plates  but  blue  (high  levels  of  pgal)  on  SD(-Trp) 
plates.  These  colonies  were  grown  up  in  liquid  SG(-Trp),  and  the  activity 
of  pgal  was  determined  as  described  below.  Plasmid  DNA  was  isolated 
from  the  positive  transformants,  amplified  in  E.coli  and  transformed 
into  S.cerevisiae  JD47-13C  {UBRl)  and  JD55  {ubrl^).  The  metabolic 
stabilities  of  the  corresponding  Arg-(K/N)i4-e^-pgal  proteins  in  ubrl/S. 
versus  wild-type  {UBRl)  S.cerevisiae  were  determined  by  measuring 
pgal  activity,  and  also  directly,  by  carrying  out  pulse-chase  assays. 

Measurement  of  pgal  activity 

Saccharomyces  cerevisiae  cells  were  added  to  5  ml  of  SD(-Trp) 
containing  0.2  mM  CUSO4,  and  grown  to  A^oo  Cells  were  gently 
pelleted  by  centrifugation,  lysed  with  glass  beads  in  20%  glycerol,  1  mM 
dithiothreitol,  0.1  M  Tris-HCl  pH  8,  and  the  activity  of  pgal  was 
measured  in  the  clarified  extract  using  £?-nitrophenyl-p-D-galactoside,  as 
described  (Ausubel  et  al,  1996).  The  activity  was  normalized  to  the  total 
protein  concentration,  determined  using  the  Bradford  assay  (Bio-Rad, 
Hercules,  CA). 

Pulse-chase  assays 

Transformed  JD47-13C  and  JD55  cells  from  10  ml  cultures  (A^oo  0.5-1) 
in  SD(-Trp)  containing  0.2  mM  CUSO4  were  gently  pelleted  by  centrifu¬ 
gation,  and  washed  in  the  same  medium.  The  cells  were  resuspended  in 
0.3  ml  of  SD(-Trp)  containing  CUSO4  and  labeled  for  5  min  at  30'^C 
with  0.15  mCi  (5.5  MBq)  of  [^^S] methionine/cysteine  (Trans^^S-label, 
ICN,  Costa  Mesa,  CA).  The  cells  were  harvested  by  centrifugation, 
resuspended  in  0.3  ml  of  SD(-Trp),  10  mM  L-methionine,  0.5  mg/ml 
cycloheximide  and  incubated  further  at  30®C.  At  each  time  point,  0.1  ml 
samples  were  withdrawn  and  added  to  0.7  ml  of  the  lysis  buffer  (1% 
Triton  X-100,  0.15  M  NaCl,  1  mM  EDTA,  50  mM  Na-HEPES  pH  7.5) 
containing  1  mM  phenylmethylsulfonyl  fiuoride.  The  cells  were  then 
lysed  by  vortexing  with  0.5  ml  of  0.5  mm  glass  beads  four  times  for 
1  min,  with  intermittent  cooling  on  ice,  followed  by  centrifugation  at 
12  000  g  for  10  min.  The  volumes  of  supernatants  were  adjusted  to 
equalize  the  amounts  of  10%  trichloroacetic  acid-insoluble  followed 
by  immu noprecipitation  with  a  mixture  of  saturating  amounts  of  mono¬ 
clonal  antibodies  against  the  ha  epitope  (Babco,  Berkeley,  CA)  and  Pgal 
(Promega,  Madison,  WI).  The  samples  were  incubated  at  4°C  for  2  h, 
with  rotation,  followed  by  the  addition  of  10  |il  of  protein  A-Sepharose 
suspension  (Repligen,  Cambridge,  MA),  further  incubation  for  1  h,  and 
centrifugation  at  12  000  g  for  30  s.  The  immunoprecipitates  were  washed 
four  times  with  0.8  ml  of  the  lysis  buffer  plus  0.1%  SDS,  resuspended 
in  SDS-sample  buffer  (Ausubel  et  al,  1996),  heated  at  100°C  for  3  min 
and  fractionated  by  SDS-10%  PAGE,  followed  by  autoradiography  and 
quantitation  with  a  Phosphorlmager  (Molecular  Dynamics,  Sunnyvale, 
CA),  using  the  reference  provided  by  the  UPR  technique  (Figures  2 
and  5)  (Levy  et  al,  1996). 

Pulse-chase  assays  with  JD54  cells,  in  which  UBRl  was  expressed 
from  the  ?gau  promoter,  were  carried  out  by  growing  20  ml  cultures 
to  A600  0.5-1  in  SR(-Trp),  collecting  and  resuspending  the  cells  in 
0.5  mi  of  the  same  medium  containing  0.2  mM  CUSO4  and  labeling 
with  0.3  mCi  of  [^^S]methionine/cysteine  for  10  min  at  30°C.  The  cells 
were  washed  twice  with  SR(-Trp),  and  transferred  to  0.5  ml  of  SR 
(-Trp)  containing  10  mM  L-methionine  and  1  mM  cysteine.  After  a 
further  20  min  incubation  at  30°C,  galactose  was  added  (to  a  final 
concentration  of  3%)  to  induce  the  expression  of  UBRl  and  initiate  the 
chase,  whose  time  points  are  indicated  in  Figures  2D  and  3D. 

ID^,  the  extent  of  initial  decay  at  the  end  of  the  5  min  pulse,  was 
calculated  as  follows:  ID^  =  {1  -  [Xpgal]o/[Mete^pgal]o}  X  100%. 
This  parameter  (the  upper  index  refers  to  the  length  of  pulse)  equals 
100%  minus  the  ratio  of  ^^S  in  an  X-pgal  to  ^^S  in  the  reference  protein 
dha-Ub  (DHFR-ha-Ub*^'^^)  at  the  end  of  the  pulse,  normalized  against 
the  same  ratio  with  metabolically  stable  Met-e^-pgal  (Levy  et  al,  1996). 
To  denote  the  observed  half-lives  of  a  test  protein  at  different  regions 
of  a  non-exponential  decay  curve,  a  generalized  half-life  term  t  was 
used,  in  which  0.5  denotes  the  parameter’s  half-life  aspect  and  y-z 
denotes  the  relevant  time  interval,  from  y  to  z  min  of  chase. 
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Abstract  The  N-end  rule  pathway  is  a  ubiquitin-de- 
pendent  proteolytic  system  whose  targets  include 
proteins  bearing  destabilizing  N-terminal  residues.  We 
carried  out  a  synthetic  lethal  screen  for  Saccharomyces 
cerevisiae  mutants  that  require  the  N-end  rule  pathway 
for  cell  viability.  A  mutant  thus  identified,  termed  sln2, 
could  not  grow  in  the  absence  of  Ubrlp,  the  recognition 
component  of  the  N-end  rule  pathway,  which  was  not 
essential  for  viability  of  the  parental  strain  under  the 
same  conditions.  Further  analysis  showed  that  inviabil¬ 
ity  of  sln2  ubrIA  cells  could  be  rescued  either  by  the 
HIS3  gene  (which  was  absent  from  the  parental  strain) 
or  by  a  high  concentration  of  histidine  in  the  medium. 
This  defect  in  histidine  uptake,  exhibited  by  the  sln2 
mutant  in  the  absence  but  not  in  the  presence  of  Ubrlp, 
was  traced  to  the  gene  HIPl,  which  encodes  the  histidine 
transporter.  HIPl  was  underexpressed  in  sln2  ubrlA 
cells,  in  comparison  to  either  sln2  UBRl  or  SLN2  ubrlA 
cells.  Yet  another  property  of  the  sln2  mutant  was  its 
inviability  at  37  °C,  which  could  not  be  rescued  by  either 
UBRl  or  HIS3.  This  feature  of  sln2  allowed  the  cloning 
of  SLN2,  which  was  found  to  be  a  gene  called  CIN8, 
encoding  a  kinesin-like  protein.  Thus,  either  the  N-end 
rule  pathway  or  CinSp  must  be  present  for  the  viability- 
sustaining  rate  of  histidine  import  in  S.  cerevisiae  aux¬ 
otrophic  for  histidine.  We  consider  possible  mechanisms 
of  this  previously  unsuspected  link  between  kinesins, 
ubiquitin-dependent  proteolysis,  and  the  import  of  his¬ 
tidine. 

Key  words  Ubiquitin  •  Proteolysis  ■  Kinesin  •  Histidine 
import  •  CIN8  ■  HIPl  ■  UBRl 
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introduction 

A  number  of  regulatory  circuits  involve  metabolically 
unstable  proteins.  A  short  in  vivo  half-life  of  a  regulator 
provides  a  way  to  generate  its  spatial  gradients  and 
allows  for  rapid  adjustments  of  its  concentration,  or 
subunit  composition,  through  changes  in  the  rate  of 
regulator’s  synthesis  or  degradation.  Short  in  vivo  half- 
lives  are  also  characteristic  of  damaged  or  otherwise 
abnormal  proteins  (Hochstrasser  1996;  Varshavsky 
1997;  Peters  et  al.  1998;  Scheflher  et  al.  1998).  Features 
of  proteins  that  confer  metabolic  instability  are  called 
degradation  signals,  or  degrons.  The  essential  compo¬ 
nent  of  one  degradation  signal,  the  first  to  be  identified, 
is  a  destabilizing  N-terminal  residue  of  a  protein 
(Bachmair  et  al.  1986;  Varshavsky  1996).  The  set  of 
amino-acid  residues  which  are  destabilizing  in  a  given 
cell  yields  a  rule,  called  the  N-end  rule,  which  relates  the 
in  vivo  half-life  of  a  protein  to  the  identity  of  its  N- 
terminal  residue.  Similar  but  distinct  N-end  rule  path¬ 
ways  are  present  in  all  organisms  examined,  from 
mammals  and  plants  to  fungi  and  bacteria  (Bachmair 
and  Varshavsky  1989;  Gonda  et  al.  1989;  Tobias  et  al. 
1991;  Varshavsky  1996;  Worley  et  al.  1998). 

In  eukaryotes,  the  degradation  signals  (N-degrons) 
recognized  by  the  N-end  rule  pathway  comprise  two 
determinants:  a  destabilizing  N-terminal  residue  and  an 
internal  lysine  or  lysines  (Bachmair  and  Varshavsky 
1989;  Hill  et  al.  1993;  DoWen  et  al.  1994).  The  Lys 
residue  is  the  site  of  formation  of  a  multiubiquitin  chain 
(Chau  et  al.  1989).  The  N-end  rule  pathway  is  thus  one 
pathway  of  the  ubiquitin  system.  Ubiquitin  is  a  76-res¬ 
idue  protein  whose  covalent  conjugation  to  other  pro¬ 
teins  plays  a  role  in  a  multitude  of  processes,  including 
cell  growth,  division,  differentiation,  and  responses  to 
stress  (Johnson  et  al.  1995;  Hochstrasser  1996;  Hershko 
1997;  Pickart  1997;  Varshavsky  1997;  Scheffner  et  al. 
1998).  In  most  of  these  processes,  Ub  acts  through 
routes  that  involve  the  degradation  of  ubiquitin-protein 
conjugates  by  the  26S  proteasome,  an  ATP-dependent 
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multisubunit  protease  (Rechsteiner  et  al.  1993;  Coux 
et  al.  1996;  Baumeister  et  al.  1998;  Glickman  et  al. 
1998). 

The  N-end  rule  is  organized  hierarchically.  In  the 
yeast  Saccharomyces  cerevisiae,  Asn  and  Gin  are  tertiary 
destabilizing  N-terminal  residues  in  that  they  function 
through  their  conversion  into  the  secondary  destabilizing 
N-terminal  residues  Asp  and  Glu.  This  conversion  is 
mediated  by  the  ATyli-encoded  N-terminal  amidase 
(Nt-amidase)  (Baker  and  Varshavsky  1995).  The  desta¬ 
bilizing  activity  of  N-terminal  Asp  and  Glu  requires 
their  conjugation,  by  the  ATEl-tncoAtA  Arg-tRNA- 
protein  transferase  (R-transferase),  to  Arg,  one  of  the 
primary  destabilizing  residues  (Balzi  et  al.  1990).  The 
primary  destabilizing  N-terminal  residues  are  bound 
directly  by  the  f/5R7-encoded  N-recognin  (also  called 
E3),  the  recognition  component  of  the  N-end  rule 
pathway  (Bartel  et  al.  1990).  In  S.  cerevisiae,  N-recognin 
is  a  225  kDa  protein  that  binds  to  potential  N-end  rule 
substrates  through  their  primary  destabilizing  N-termi- 
nal  residues  -  Phe,  Leu,  Trp,  Tyr,  He,  Arg,  Lys,  and  His 
(Varshavsky  1996).  The  [/6r /-encoded  mammalian  N- 
recognin  (]E3a)  has  also  been  characterized  (Hershko 
1991;  Kwon  et  al.  1998).  Ubrlp  has  at  least  two  sub¬ 
strate-binding  sites.  The  type-1  site  is  specific  for  the 
basic  N-terminal  residues  Arg,  Lys  and  His.  The  type-2 
site  is  specific  for  the  bulky  hydrophobic  N-terminal 
residues  Phe,  Leu,  Trp,  Tyr,  and  He  (Varshavsky  1996). 
Both  yeast  and  mammalian  Ubrlp  have  also  been 
shown  to  recognize  a  set  of  internal  degrons  that  remain 
to  be  characterized  (Gonen  et  al.  1991;  Madura  and 
Varshavsky  1994;  Byrd  et  al.  1998). 

The  known  functions  of  the  N-end  rule  pathway  in¬ 
clude  the  control  of  peptide  import  in  S.  cerevisiae, 
through  the  degradation  of  Cup9p,  a  transcriptional 
repressor  of  PTR2  which  encodes  the  peptide  trans¬ 
porter  (Alagramam  et  al.  1995;  Byrd  et  al.  1998);  a  role 
in  regulating  the  Slnlp-dependent  phosphorylation  cas¬ 
cade  that  mediates  osmoregulation  in  S.  cerevisiae  (Ota 
and  Varshavsky  1993);  the  degradation  of  Gpalp,  a  Ga 
protein  of  S.  cerevisiae  (Madura  1994,  #60);  and  the 
conditional  degradation  of  alphaviral  RNA  polymerase 
in  virus-infected  metazoan  cells  (deGroot  et  al.  1991; 
Varshavsky  et  al.  1998).  Physiological  N-end  rule  sub¬ 
strates  were  also  identified  among  the  proteins  secreted 
into  the  mammalian  cell’s  cytosol  by  intracellular  para¬ 
sites  such  as  the  bacterium  Listeria  monocytogenes  (Sijts 
et  al.  1997).  Inhibition  of  the  N-end  rule  pathway  was 
reported  to  interfere  with  mammalian  cell  differentiation 
(Hondermarck  et  al.  1992),  and  to  delay  limb  regener¬ 
ation  in  amphibians  (Taban  et  al.  1996).  In  mammals, 
the  N-end  rule  pathway  is  also  likely  to  play  a  role  in 
catabolic  states  that  result  in  muscle  atrophy  (Solomon 
et  al.  1998  a,b). 

While  the  absence  of  the  N-end  rule  pathway 
(specifically,  the  absence  of  Ubrlp)  from  S.  cerevisiae  is 
lethal  under  conditions  where  the  import  of  peptides  is 
essential  for  cell  survival  (Alagramam  et  al.  1995;  Byrd 
et  al.  1998),  ubrlA  cells  are  viable  and  only  slightly 


growth-retarded  on  standard  media  (Bartel  et  al.  1990). 
Synthetic  lethal  screens  can  often  identify  pathways 
whose  essential  functions  overlap  (or  interact)  with  the 
function  of  a  system  of  interest  (Singer-Kruger  and 
Ferro-Novick  1997;  Ho  et  al.  1998;  Merrill  and  Holm 
1998).  An  earlier  screen  for  S.  cerevisiae  mutants  that 
require  Ubrlp  for  viability  has  identified  Slnlp,  which 
contains  a  sensor  kinase  and  response  regulator  domains 
analogous  to  those  of  the  prokaryotic  two-component 
regulators  (Ota  and  Varshavsky  1993).  Slnlp  functions 
in  the  HOG  signaling  pathway,  which  regulates  the 
response  of  cells  to  changes  in  the  media  osmolarity 
(Posas  et  al.  1996).  slnl  mutants  are  growth-impaired  in 
presence  of  Ubrlp  and  inviable  in  its  absence  (Ota  and 
Varshavsky  1993).  The  identity  of  a  relevant  Ubrlp 
substrate  in  the  HOG  pathway  is  unknown. 

In  the  present  work,  we  carried  out  a  modified  syn¬ 
thetic  lethal  screen  for  S.  cerevisiae  mutants  that  require 
Ubrlp  for  viability.  Our  findings  uncovered  a  previously 
unsuspected  connection  between  kinesins,  the  N-end 
rule  pathway,  and  the  import  of  histidine. 


Materials  and  methods 

Yeast  strains,  media,  and  genetic  procedures.  S.  cerevisiae  strains 
used  in  this  study  are  listed  in  Table  1.  Yeast  were  grown  in  rich 
(YPD)  medium,  or  in  synthetic  media  containing  0.67%  yeast  ni¬ 
trogen  base  without  amino  acids  (Difco),  auxotrophic  nutrients, 
including  adenine,  arginine,  histidine,  leucine,  lysine,  methionine, 
phenylalanine,  threonine,  tryptophan  and  uracil,  and  either  2% 
dextrose  (glucose)  (SD  medium)  or  2%  galactose  (SG  medium) 
(Sherman  et  al.  1986).  In  some  experiments,  the  concentration  of 
histidine  in  SD  medium  was  increased  10-fold,  to  0.2  mg/ml.  Un¬ 
less  stated  otherwise,  cells  were  incubated  at  30  “C.  Yeast  mating, 
sporulation,  and  tetrad  dissection  were  performed  as  described 
(Sherman  et  al.  1986).  The  mating-type  switch  was  carried  out 
through  transformation  with  the  plasmid  pGAL-HO  (Herskowitz 
and  Jensen  1991).  S.  cerevisiae  were  transformed  as  described  by 
Chen  et  al.  (1992).  The  Escherichia  coli  strain  used  was  DHSa 
(Ausubel  et  al.  1996). 

Synthetic  lethal  screen.  The  haploid  Poali-UBRI  strain  JD54,  in 
which  the  UBRl  gene  is  expressed  from  the  Pgali  promoter 
(Ghislain  et  al.  1996),  was  mutagenized  with  ethyl  methane- 
sulfonate  (EMS)  (Sigma,  St.  Louis,  Mo.)  to  approximately  7% 
survival  upon  plating  to  SG  medium  at  30  °C.  The  colonies  on  SG 
plates  were  replica-plated  onto  SD  plates.  The  colonies  that  grew 
on  SG  but  not  on  SD  media  were  re-tested  for  the  absence  of 
growth  on  SD  plates.  The  candidate  isolates  were  transformed  with 
plasmids  that  expressed  UBRl  either  from  its  natural  promoter 
[pUBRl  (Bartel  et  al.  1990)]  or  the  much  stronger  Padhj  promoter, 
and  tested  for  survival  on  SD  plates.  One  mutant,  termed  sln2 
(AVY200),  was  rescued  by  UBRl  on  SD  plates,  and  was  selected 
for  further  study. 

Isolation  of  HIS3  as  a  suppressor  of  sln2.  The  sln2  mutant  was 
transformed  with  a  yeast  genomic  DNA  library  (American  Type 
Culture  Collection,  #77164)  in  the  TRPl,  CEN6-based  vector 
pRS200  (Sikorski  and  Hieter  1989).  Cells  were  plated  on  SD(-Trp) 
plates  and  incubated  at  30  ®C  for  2-3  days.  Out  of  approximately 
4  X  10^  transformants,  25  could  grow  on  SD(-Trp).  Library  plas¬ 
mids  were  rescued  from  these  transformants  and  re-transformed 
into  sln2  cells.  Eight  of  these  plasmids  conferred  viability  on  sln2 
cells  after  re-transformation  and  selection  on  SD(-Trp).  Restriction 
analyses  showed  that  these  plasmids  were  either  identical  or  had 
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Table  1  S.  cerevisiae  strains 
used  in  this  study 


Strain 

Genotype 

References 

JD52 

MATa  ura3-52  lys2-801  trpl-A63  his3-A200 
leu2-3.  112 

Johnson  et  al.  (1995) 

JD54 

MATa  ura3-52  lys2-801  trpl-A63  his3-A200 
leu2-3.  112  GALL  :  UBRl 

Ghislain  et  al.  (1996) 

AVY199 

MATa  ura3-52  lys2-801  trpl-A63  his3-A200 
leu2-3.  112  GAL1::UBR1 

Derivative  of  JD54 

AVY200^ 

MATa  sln2  ura3-52  lys2-801  trpl-A63  his3-A200 
leu2-3,  112  GAL1::UBR1 

Derivative  of  JD54 

AVY201 

MATa  ura3-52  lys2-801  trpl-A63  his3-A200 
leu2-3,  112  GALL.  UBRl  hiplA-3::URA3 

Derivative  of  JD54 

AVY202 

MATa  ura3-52  lys2-801  trpl-A63  his3-A200 
leu2-3.  112  GAL1::UBR1  cm8A-6::URA3 

Derivative  of  JD54 

AVY203 

MATa  ura3-52  lys2-801  trpl-A63  his3-A200 
leu2-3,  112  cin8A-!0;:URA3 

Derivative  of  JD52 

AVY204 

MATa  sln2  ura3-52  lys2-801  trpl-A63  his3-A200 
leu2-3.  112  GAL1::UBR1  cup9A-l::LEU2 

Derivative  of  AVY200 

^  Mutant  {sln2)  that  requires  UBRl  for  viability 


large  fragments  in  common.  Truncation  and  sequencing  analyses 
localized  the  complementing  activity  to  a  1.7-kb  BamKl  fragment 
containing  the  HISS  gene.  This  finding  was  independently 
confirmed  by  showing  that  the  HISS  open  reading  frame  (ORF), 
expressed  from  the  Pmet25  promoter  (Mumberg  et  al.  1994),  was 
also  able  to  complement  the  sln2  mutant  on  SD  plates. 

Histidine  uptake  assay.  This  assay  was  carried  out  as  described 
(Grauslund  et  al.  1995),  with  minor  modifications.  Briefly,  the 
strains  to  be  assayed  were  grown  to  A^oo  of  about  0.6  in  SD(-His) 
medium  at  30°C.  Cells  were  harvested  from  5-ml  cultures  and 
resuspended  in  0.4-0.5  ml  of  SD(-His)  at  an  identical  cell  density, 
adjusted  using  a  hemocytometer.  For  each  sample,  0.1  ml  of  sus¬ 
pension  was  added  to  0. 1  ml  of  SD(-His)  medium  containing  8  pM 
of  L-[^'*C]histidine  (Amersham,  Arlington  Heights,  Ill.;  specific 
radioactivity  0.3  Ci/mmol),  and  incubated  at  30°C  for  2  min. 
Samples  were  removed,  filtered  through  a  #30  glass-fiber  filter 
(Schleicher  and  Schuell),  washed  with  10  ml  of  ice-cold  water,  and 
air-dried.  radioactivity  was  determined  using  a  liquid  scintil¬ 
lation  counter.  Since  high  concentrations  of  histidine  in  the  medi¬ 
um  would  have  interfered  with  the  assay,  HISS  was  introduced  (as 
the  pRS313  plasmid)  into  all  hisSA  strains  to  be  tested,  making 
them  prototrophic  for  histidine. 

Northern  hybridization,  S,  cerevisiae  strains  were  grown  at  30°C  to 
Aeoo  of  about  I .  RNA  was  isolated  as  described  (Schmitt  et  al. 
1990)  and  fractionated  by  electrophoresis  in  formaldehyde-con¬ 
taining  agarose  gels  (Ausubel  et  al.  1996).  Briefly,  approximately 
40  pg  of  RNA  per  lane  was  fractionated  in  a  1.2%  agarose  gel 
containing  0.66  M  formaldehyde,  1  x  MOPS  buffer  and  0.1  pg/ml 
of  ethidium  bromide  (Ausubel  et  al.  1996).  Electrophoresis  was 
performed  at  5  V/cm  in  1  x  MOPS  buffer.  Fractionated  RNA  was 
transferred  onto  a  GeneScreen  Plus  membrane  (DuPont  NEN, 
Boston,  Mass.)  according  to  the  manufacturer’s  instructions.  DNA 
probes  were  labeled  with  [a-^^P]dCTP  using  a  DNA  labeling  kit 
(Pharmacia  Biotech,  Piscataway,  N.J.).  Hybridization  was  carried 
out  using  a  NorthemMax  kit  (Ambion,  Austin,  Tex.).  Hybridiza¬ 
tion  patterns  were  detected  by  autoradiography  and  quantitated 
using  Phosphorlmager  (Molecular  Dynamics). 

hiplA  mutant.  This  mutant  was  produced  from  JD54  (Table  1) 
using  a  direct  deletion  technique  described  previously  (Baudin  et  al. 
1993).  Briefly,  two  oligodeoxynucleotides  of  66  nt  were  synthesized, 
attagtacacaaatacagatatacaacctcaacatcaaaaatgtcgaaagctacat- 
ataaggaacg  and  gatatggaaattccgctgtatagctcatctctttcccttcattagtttt- 
gctggcctcatcttctc.  One  oligonucleotide  comprised  40  nt  of  a 
sequence  derived  from  the  5'  untranslated  region  (UTR)  of  the 
HIPl  gene,  followed  by  26  nt  of  a  sequence  derived  from  the  URAS 
ORF,  beginning  from  its  start  codon.  The  second  oligonucleotide 


comprised  40  nt  of  a  sequence  derived  from  the  3'  UTR  of  HIPU 
followed  by  26  nt  of  a  sequence  derived  from  the  3'-proximal 
region  of  the  URAS  ORF,  including  its  stop  codon.  The  two 
oligonucleotides  were  used  as  PCR  primers  to  amplify  URAS  from 
the  pRS306  vector  (Sikorski  and  Hieter  1989).  The  resulting 
targeting  fragment  contained  URAS  flanked  on  either  side  by  40-bp 
sequences  derived  from,  respectively,  the  5'  and  3'  UTRs  of  HIPl, 
JD54  cells  (Table  1)  were  transformed  with  the  targeting  fragment, 
and  selected  for  Ura"^  transformants  at  30°C  on  SD(-Ura)  plates 
containing  0.2  mg/ml  of  histidine.  In  a  typical  experiment, 
approximately  50%  of  Ura"^  transformants  were  the  expected 
hiplA  mutants,  as  verified  by  PCR,  and  also  by  checking  for  the 
phenotype  of  dependence  on  high  concentrations  of  histidine  in  the 
medium. 

Isolation  of  HIPl,  A  gap-repair  method  was  used  to  rescue  HIPl 
from  both  wild-type  (JD54)  and  sln2  (AVY200)  strains  (Rothstein 
1991).  Two  fragments  (the  left  and  right  arms)  flanking  the  gap 
were  amplified  by  PCR  from  either  JD54  or  sln2  genomic  DNA. 
The  left  arm  was  a  693-bp  A7ioI-///«dIII  fragment  corresponding  to 
the  5'  UTR  of  HIPl  from  -976  bp  to  -284  bp.  The  right  arm  was  a 
1044-bp  HinAlWXbal  fragment,  corresponding  to  the  3'  UTR  of 
HIPf  470  bp  downstream  from  the  stop  codon  of  HIPl,  The  left 
and  right  arms  were  inserted  into  the  Xhol  and  Xba\  sites  of 
pRS315,  a  LEU2j  CENS-hascd  vector  (Sikorski  and  Hieter  1989). 
The  resulting  construct  was  linearized  with  HindlUy  and  trans¬ 
formed  into  wild-type  and  sln2  cells  (see  Fig.  3  A).  Plasmids  were 
isolated  from  the  Leu'*'  transformants.  Gap-filled  plasmids 
(pRS315HIPl)  were  identified  by  restriction  analysis,  and  also  by 
sequencing  the  relevant  junctional  regions.  The  primers  for 
producing  the  left-arm  fragment  were  ggggctgaaacacaagaattcataatg 
and  gcacctacttcaactccttagcag.  The  initial  PCR  product  was 
digested  with  Xhol  and  ^Twdlll,  yielding  the  left-arm  DNA  frag¬ 
ment.  The  primers  for  producing  the  right  arm  were  ctactacaga- 
tattacatgtggcg  and  ggtttcggtagaatcggtagattg.  The  initial  PCR 
product  was  digested  with  Hindlll  and  Xbal,  yielding  the  right 
arm.  All  PCR  products  were  verified  by  DNA  sequencing. 

Construction  of  the  sln2  cup9A  mutant.  The  previously  described 
cup9::LEU2  disruption  allele  (Byrd  et  al.  1998)  was  used  to  replace 
the  CUP9  allele  in  sln2  cells  through  homologous  recombination 
(Rothstein  1991),  yielding  the  strain  AVY204  (Table  1).  The  ab¬ 
sence  of  Cup9p  was  verified  by  testing  for  the  ability  of  AVY204 
cells  to  import  dipeptides  in  the  absence  of  UBRl  (on  SD  plates) 
(Byrd  et  al.  1998),  and  also  by  PCR  analysis. 

Isolation  of  the  SLN2  (CIN8)  gene.  The  S.  cerevisiae  genomic 
DNA  library  used  for  the  isolation  of  HISS  (see  above)  was 
transformed  into  the  sln2  strain  (AVY200)  (Table  1).  Cells  were 
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plated  on  SD(-Trp)  plates  containing  histidine  at  0.2  mg/ml,  and 
incubated  at  30®C  for  8  h.  The  temperature  was  then  raised  to 
37°C,  until  the  colonies  formed  (about  3  days).  Out  of  approxi¬ 
mately  6  X  lO'^  transformants  at  30°C,  15  could  grow  on  SD(-Trp, 
+  0.2  mg/ml  His)  at  37®C.  Library  derived  plasmids  were  rescued 
from  these  15  transformants  and  re-transformed  into  sln2  cells, 
with  selection  on  SD(-Trp)  plates  at  37°C.  Three  plasmids 
conferred  viability  on  sln2  cells  after  re-transformation.  Restriction 
mapping  showed  the  inserts  of  these  plasmids  to  be  identical. 
Truncation  and  sequence  analyses  confirmed  that  the  comple¬ 
menting  activity  was  confined  to  the  CIN8  locus. 


rather  than  on  the  nature  of  carbon  source  (dextrose 
versus  galactose),  the  putative  mutants  were  trans¬ 
formed  with  a  low-copy  plasmid  pUBRl  expressing 
UBRl  from  its  natural  promoter,  and  plated  on  SD 
plates.  Only  one  isolate,  termed  sln2  (strain  AVY200; 
Table  1),  formed  colonies  on  SD  plates  in  the  presence 
of  pUBRl  but  not  in  the  presence  of  the  control  vector 
(Fig.  1  A-C).  The  inviability  of  sln2  cells  on  SD  plates 


Isolation  of  the  CIN8*  allele  from  sln2  cells.  A  gap-repair  method 
(Rothstein  1991)  described  above  for  HIPJ  was  used  to  isolate  the 
mutant  allele  of  CIN8  (denoted  as  CIN8*)  from  sln2  cells.  A  4973- 
bp  EcoRl-BamKl  fragment  containing  wild-type  CIN8,  including 
its  5'  (464  bp)  and  3'  (1353  bp)  untranslated  regions  (UTRs),  was 
subcloned  into  the  TR/^y-based  vector  pRS314  (Sikorski  and 
Hieter  1989).  The  resulting  pRS314CIN8  was  gapped  by  digestion 
with  Pad  and  R^^RI,  and  transformed  into  the  sln2  strain 
(AVY200).  Plasmids  were  isolated  from  the  Trp'^  transformants. 
The  gap-filled  plasmids  (pRS314CIN8*)  were  verified  by  restriction 
analysis  and  sequencing  of  the  relevant  junctional  regions. 

Construction  of  null  cin8  mutants.  An  approximately  2.5-kb  Sphh 
Aatll  DNA  fragment  of  pRS314CIN8  was  replaced  by  a  PCR- 
produced,  approximately  1.1 -kb,  URA3-containing  Sphl- Aatll 
fragment  of  YIp5,  yielding  pRS314cin8::URA3.  A  SaH-BamUl 
fragment  of  pRS314cin8::URA3  containing  the  c//j5::URA3  dele¬ 
tion/disruption  allele  was  used  to  replace,  through  homologous 
recombination,  the  CIN8  gene  in  JD52,  a  wild-type  (Ubrl  strain, 
and  in  JD54,  a  Pgali'  AJBRI  strain.  The  resulting  cin8A  mutants 
AVY203  and  AVY202,  respectively  (Table  1),  lacked  codons  169- 
1015  of  the  CIN8  ORF.  The  structure  of  the  cin8A  allele,  and  the 
temperature  sensitivity  of  cin8A  mutants  were  confirmed, 
respectively,  using  PCR  and  ^owth  tests  at  37°C.  The  primers  for 
producing  the  f/R^i-containing  DNA  fragment  were  cattG- 
CATGcgtccgtggaattctcatgtttgac  and  agtgGACGTCtaccaggtgcct- 
gactgcgttagc  (uppercase  letters  denote  the  Sphl  and  Aatll  sites, 
respectively). 


Results 

Isolation  of  the  sln2  mutant 

The  earlier  screen  for  S.  cerevisiae  mutants  requiring  the 
N-end  rule  pathway  (specifically,  Ubrlp)  for  viability 
(Ota  and  Varshavsky  1993)  was  far  from  exhaustive,  and 
in  addition  utilized  wild-type  levels  of  Ubrlp  expression. 
Since  the  activity  of  the  N-end  rule  pathway  is  higher  in 
cells  overproducing  Ubrlp  (Bartel  et  al.  1990;  Madura 
and  Varshavsky  1994),  the  present  screen  was  carried 
out  using  the  strain  JD54  (Ghislain  et  al.  1996),  in  which 
UBRl  was  expressed  from  the  galactose-inducible, 
dextrose-repressible  ^gali  promoter  (Table  1). 
Mutagenized  JD54  cells  were  plated  onto  SG  (galactose- 
containing)  plates,  and  the  colonies  formed  were  replica- 
plated  onto  SD  (dextrose-containing)  plates,  where  the 
N-end  rule  pathway  was  inactive,  owing  to  the  absence 
of  Ubrlp  expression.  Cells  that  yielded  colonies  on  SG 
but  not  on  SD  plates  were  putative  sin  (.synthetic  /ethal 
of  V-end  rule)  mutants.  Four  such  isolates  were  identi¬ 
fied  among  approximately  3.5  x  10^  colonies  screened. 

To  verify  that  the  conditional  viability  of  these 
isolates  was  in  fact  dependent  on  the  presence  of  Ubrlp, 


SG  (37^C) 

Fig.  1  A-D  A  S.  cerevisiae  mutant  whose  viability  requires  the  N-end 
rule  pathway.  (A  and  B)  the  AVY200  (sln2  ?gal‘UBR])  strain  grows 
on  SG  but  not  on  SD  plates  (see  Results).  (C)  evidence  that  the 
viability  of  sln2  mutant  requires  Ubrlp  rather  than  galactose. 
AVY200  {sln2  Pgal'UBR])  cells  were  streaked  on  SD  plates  after 
having  been  transformed  either  with  pUBRl,  a  low-copy  plasmid 
expressing  UBRI  from  its  natural  promoter  or  with  pADH-UBRl, 
another  CRV-based  plasmid  expressing  UBRI  from  the  constitutive 
and  strong  Padnj  promoter,  or  with  vector  alone.  (D)  sln2  cells  are  ts 
for  growth.  The  parental  JD54  (Pgal’UBRI)  strain  and  the  AVY200 
(sln2  Vgal'^RR^  strain  (Table  1)  were  streaked  on  SG  plates  and 
incubated  at  37  ®C 
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could  be  rescued  by  UBRl  expressed  at  either  normal 
(pUBRl)  or  much  higher  levels  (pRB208,  which 
expressed  UBRl  from  the  Badhi  promoter)  (Fig.  1  C). 


Histidine  import  defect  of  sln2  cells 
in  the  absence  of  Ubrlp 

Crossing  AVY200  {sln2)  to  the  strain  AVY199  (congenic 
to  JD54  but  of  the  opposite  mating  type)  (Table  1)  in¬ 
dicated  that  the  sln2  mutation  was  recessive  (data  not 
shown).  We  used  a  library  of  S.  cerevisiae  genomic  DNA 
fragments  in  a  low-copy  vector  to  transform  AVY200 
cells,  screening  for  genes  that  could  rescue  the  inviability 
of  sln2  on  SD  plates  (in  the  absence  of  Ubrlp).  Sur¬ 
prisingly,  HIS3,  a  gene  of  the  histidine  biosynthesis 
pathway,  was  found  to  be  a  strong  suppressor  of  the 
lethality  of  AVY200  {sln2)  on  SD  plates  (Fig.  2  A). 
Since  AVY200  is  a  his3A  strain  (Table  1),  HIS3  is,  by 
definition,  an  extragenic  suppressor  of  the  conditionally 
lethal  sln2  phenotype,  i.e.,  HIS3  is  distinct  from  SLN2. 

The  original  JD54  (his3A)  strain,  being  a  histidine 
auxotroph,  required  the  presence  of  histidine  in  the 
medium.  In  the  presence  of  HIS3,  the  JD54-derived 
AVY200  {sln2)  mutant  would  be  expected  to  become 
prototrophic  for  histidine.  Could  the  effect  of  the  sln2 
mutation  be  an  impairment  of  the  histidine  uptake  in  the 
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Fig.  2  Inviability  of  the  AVY200  (sln2  strain  on  SD 

plates  can  be  rescued  either  by  the  HIS3  gene  (panel  A)  or  by  high 
( >  75  |ig/ml)  concentration  of  histidine  in  the  medium  (panel  B).  See 
Results  for  details 


absence  of  the  N-end  rule  pathway  (specifically,  in  the 
absence  of  Ubrlp)?  This  interpretation  could  account 
for  the  suppression  of  inviability  of  AVY200  cells  on  SD 
plates  by  the  HIS3  gene,  inasmuch  as  the  resulting  strain 
would  be  a  histidine  prototroph.  We  first  asked  whether 
increasing  the  concentration  of  histidine  in  the  growth 
medium  would  rescue  AVY200  {sln2)  cells  on  SD  plates. 
[It  has  previously  been  shown  that  cells  defective  in 
histidine  uptake  could  grow  on  a  medium  containing 
sufficiently  high  concentrations  of  histidine  (Tanaka  and 
Fink  1985).]  Indeed,  AVY200  (sln2  Bgal-UBRI)  cells 
were  found  to  grow  on  SD  plates  containing  histidine  at 
200  |ig/ml,  but  could  not  grow  at  the  standard  histidine 
concentration  of  20  pg/ml  (Fig.  2  B). 

To  examine  the  histidine  import  of  AVY200  {sln2) 
cells  in  a  different  way,  we  compared  the  uptake  of 
L-[*‘*C]histidine  into  slh2  and  other  yeast  strains.  Since 
high  concentrations  of  histidine  in  the  medium  would 
have  interfered  with  the  assay,  we  introduced  the  HIS3 
gene  into  all  four  of  the  his3A  strains  to  be  tested, 
making  them  prototrophic  for  histidine.  As  shown  in 
Fig.  3,  in  the  absence  of  Ubrlp  the  relative  rate  of 
histidine  uptake  in  AVY200  {_sln2  Vgal-UBRI)  cells  was 
about  35®/o  of  the  rate  in  the  parental  JD54  (SLN2 
Fgal-UBRI)  cells.  In  the  presence  of  Ubrlp  (at  its 
wildtype  concentration)  the  rate  of  histidine  uptake  in 
sln2  cells  increased  to  about  75%  of  that  in  SLN2  cells 
(Fig.  3),  thus  accounting,  at  least  in  part,  for  the 
observed  growth  of  sln2  cells  on  SG  plates  (Fig.  1  B). 
The  incomplete  recovery  of  histidine  uptake  in  Ubrlp- 
expressing  AVY200  (sln2)  cells  was  consistent  with  their 


SLAe  sln2  hipU  sln2(UBR1) 

Fig.  3  sln2  cells  are  impaired  in  the  import  of  histidine.  SLN2  is  the 
parental  JD54  strain  (^q^-UBR1)  (Table  1).  hipIA  is  a  /f/P7-lacking 
derivative  of  JD54.  sln2  (pUBRl)  is  the  AVY200  {sln2  ’Pgal-UBRI) 
strain  carrying  pUBRl.  TTie  uptake  of  L-['^C]histidine  was  measured 
as  describe  in  Materials  and  methods.  The  results  are  expressed  as 
the  percent  of  uptake  by  SLN2  cells,  and  represent  the  means  from 
three  independent  experiments.  Standard  deviations  are  shown  above 
the  bars 
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detectably  slower  growth  on  SG  plates  in  comparison  to 
JD54  {SLN2)  cells  (Fig.  1  A,B)-  Note  that  the  decreased 
(about  35%  of  normal)  rate  of  '‘*C-histidine  import  in 
sln2  cells  was  still  considerably  higher  than  the  one  in 
hiplA  cells,  which  lack  the  histidine-specific  transporter 
(Fig.  3).  This  observation  is  in  agreement  with  the  fact 
that  the  minimal  concentration  of  histidine  that  allowed 
the  growth  of  sln2  cells  on  SD  plates  (75  pg/ml)  was 
much  lower  than  the  concentration  of  histidine  (200  pg/ 
ml)  required  for  the  comparable  growth  rate  of  hiplA 
cells  (data  not  shown). 


Down-regulation  of  HIPl  in  the  sln2  ubrlA  mutant 

In  S.  cerevisiae,  the  import  of  histidine  across  the  plasma 
membrane  is  carried  out  largely,  if  not  exclusively,  by 
two  transporters  (permeases),  Hiplp  and  Gaplp  (Tan¬ 
aka  and  Fink  1985;  Jauniaux  and  Grenson  1990).  Gaplp 
is  a  general  amino-acid  transporter  whose  activity  is 
repressed  by  rich  nitrogen  sources  such  as  NH4  in  SD  or 
SG  media  (Jauniaux  and  Grenson  1990;  Hein  et  al. 
1995).  Consequently,  the  histidine-specific  permease 
Hiplp  is  the  main  importer  of  histidine  under  these 
conditions.  The  reduced  uptake  of  histidine  by  AVY200 
(sln2)  cells  in  the  absence  of  Ubrlp  (in  the  absence  of  the 
N-end  rule  pathway)  could  be  caused  by  a  defective 
HIPl  allele  in  sln2  cells  or  by  a  reduced  expression  of  the 
otherwise  wild-type  HIPl. 

To  address  the  first  possibility,  we  used  the  method 
of  gap  repair  (Fig.  4  A)  (see  Materials  and  methods)  to 
isolate  HIPl  from  both  AVY200  {sln2)  and  the  parental 
JD54  {SLN2)  strains.  Complementation  testing  of  HIPl 
from  these  strains  with  hiplA  S.  cerevisiae  showed  that 
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Fig.  4  A-C  Gap  repair-based  cloning  of  the  HIPl  and  CIN8*  alleles 
from  sln2  cells.  Panels  A  and  B  illustrate,  respectively,  the  isolation  of 
HIPl  and  an  allele  of  CIN8  (CIN8*)  from  AVY200  isln2  Pgal’ 
UBRl)  cells.  C  deletion/disruption  of  the  CJN8  gene.  See  Materials 
and  methods 
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Fig.  5  Northern-hybridization  analyses  of  the  S.  cerevisiae  HIPl 
mRNA  in  different  genetic  backgrounds.  (A  and  B)  ^^P-labeled  HIPl 
and  ACTI  (actin)  DNA  probes  (the  latter  appropriately  diluted)  were 
used  to  determine  the  levels  of  the  corresponding  mRNAs  in 
electrophoretically  fractionated  RNA  from  strains  whose  relevant 
genetic  backgrounds  are  indicated  at  the  top  of  panels.  “pUBRl”  and 
“pADH-UBRl”  denote  RNA  from  strains  carrying  low-copy 
plasmids  expressing  UBRl  from  either  the  natural  UBRl  promoter 
or  from  the  Padhi  promoter.  See  also  Materials  and  methods 


the  two  HIPl  alleles  were  indistinguishable  in  their 
ability  to  restore  histidine  import  in  hiplA  cells  (data 
not  shown),  strongly  suggesting  that  sln2  cells  contained 
the  wild-type  version  of  HIPl.  Quantitative  Northern 
hybridization  was  then  used  to  compare  the  levels  of 
HIPl  mRNA  in  sln2  and  congenic  SLN2  cells  in  the 
absence  and  presence  of  Ubrlp  (Fig.  5  and  see  Mate¬ 
rials  and  methods).  The  relative  level  of  HIPl  mRNA 
was  reproducibly  about  2-fold  lower  in  sln2  cells  lack¬ 
ing  Ubrlp  than  in  otherwise  identical  SLN2  cells  or  in 
Ubrlp-containing  SLN2  cells  (Fig.  5  A  and  see  Ma¬ 
terials  and  methods).  This  finding  strongly  suggested 
that  the  inviability  of  Ubrlp-lacking  AVY200  {sln2 
^gal-UBRI)  cells  at  the  standard  concentration  of 
histidine  in  SD  medium  was  caused  by  underexpression 
of  the  Hiplp  transporter.  Consistent  with  this  interpr¬ 
etation,  we  found  that  about  a  2-fold  increase  in  the 
HIPl  copy  number,  through  transformation  with  a 
low-copy  plasmid  expressing  HIPl  from  its  natural 
promoter,  rescued  the  inviability  of  AVY200  cells  on 
SD  plates  (Fig.  6  A).  Note  that  the  HIS3  gene  on  a  low- 
copy  plasmid  rescued  AVY200  {sln2  Poal-UBRI)  cells 
more  effectively  than  did  the  plasmid-bome  extra  copy 
of  HIPl  (Fig.  6  A),  again  in  agreement  with  the  ob¬ 
served  down-regulation  of  HIPl  in  AVY200  cells.  This 
finding  also  accounted,  in  hindsight,  for  the  isolation  of 
H1S3,  but  not  of  HIPl,  in  our  (far  from  exhaustive) 
complementation  screen. 


Ubrlp  increases  the  expression  of  HIPl  in  sln2 
but  not  in  SLN2  cells 

The  effect  of  UBRl  expression  on  the  uptake  of  histidine 
by  Ubrlp-lacking  sln2  cells  (Fig.  3),  and  the  requirement 
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Fig.  6A-D  Effects  of  HIPI  and  CIN8*  on  the  growth  of  strains 
lacking  the  N^end  rule  pathway.  A  AVY200  {sln2  Poal-UBRI)  cells 
(Table  1)  carrying  a  low-copy,  I,£L/2-marked  vector  pRS315  or  the 
otherwise  identical  plasmids  expressing  either  HIS3  or  HIPI  were 
grown  on  SD(-Leu,  +  0.2  mg/ml  of  histidine)  plates  to  allow  colony 
formation  by  all  strains.  Individual  colonies  were  then  streaked  onto 
SD(-Leu)  plates.  Note  the  rescue  of  the  AVY200  {sln2  PoaltUBRI) 
cells  by  HIPI  and  their  even  stronger  rescue  by  HIS3.  B  neither  the 
CIN8*  allele  (isolated  from  stn2  cells)  nor  the  corresponding  TRPl- 
based  vector  pRS314  rescued  inviability  of  AVY200  (s/n2  Pgal- 
UBRI)  cells  on  SD  plates,  in  contrast  to  the  wild-type  CIN8  gene.  C 
cinSA  cells  (AVY202  and  AVY203,  respectively)  derived  from  either 
the  JD54  ^GALrUBRI)  or  JD52  (UBRl)  strains  (Table  1)  were 
streaked  onto  SD  plates.  Note  inviability  of  cinSA  cells  in  the  absence 
of  UBRI  expression.  Cells  were  grown  at  first  on  SD(-Ura,  +  0.2  mg/ 
ml  of  histidine)  plates  to  allow  colony  formation  by  both  strains. 
Individual  colonies  were  then  streaked  onto  SD(-Ura)  plates.  D  same 
as  in  C  but  cin8A  JD54  (cm8A  Pgal-UBRI)  cells  carried  either  the 
control  vector  pRS314  or  the  same  vector  expressing  HIS3.  See 
Materials  and  methods 


of  Ubrlp  for  the  viability  of  these  cells  on  SD  plates 
(Fig.  1  A-C),  could  be  due  to  an  increase  of  HIPI 
expression  in  the  presence  of  Ubrlp.  Comparisons  of 
HIPI  mRNA  levels  among  sln2  strains  that  either  lacked 
Ubrlp  or  produced  increasing  amounts  of  Ubrlp 
showed  that  the  expression  of  HIPI  was  indeed  in¬ 
creased  in  the  presence  of  Ubrlp  in  sln2  cells,  but  not  to 
its  wild-type  level  (Fig.  5).  As  quantified  with  Phos- 
phorlmager  and  internal  reference,  the  increase  caused 
by  the  presence  of  Ubrlp  was,  reproducibly,  about  1.5- 
fold,  in  comparison  to  the  approximately  2-fold  increase 
that  would  have  been  required  for  restoration  of  the 
wild-type  level  of  HIPI  mRNA  (Fig.  5  and  data  not 
shown).  The  partial  restoration  of  HIPI  expression  by 
Ubrlp  in  sln2  cells  is  consistent  with  the  observed  partial 
recovery  of  histidine  uptake  (Fig.  3),  and  with  lower 
than  wild-type  growth  fates  of  Ubrlp-supplemented  sln2 
cells  (Fig.  1  B). 

We  also  asked  whether  the  presence  of  Ubrlp  could 
increase  the  expression  of  HIPI  in  SLN2  cells.  As 
shown  in  Fig.  5  A,  the  expression  of  Ubrlp,  at  in¬ 
creasing  levels,  in  SLN2  ubrIA  cells  did  not  influence 
the  expression  of  HIPI.  Taken  together,  our  results 
indicated  that  at  least  two  distinct  pathways  influence 
the  expression  of  HIPI:  a  5'LAf2-dependent  pathway 
and  the  £75/?7-dependent  N-end  rule  pathway.  The 
latter  pathway  is  sufficient  for  the  viability-maintaining 
expression  of  HIPI  in  the  absence  of  SLN2.  Although 
the  expression  of  HIPI  was  not  extinguished  even  in 
the  absence  of  both  SLN2  and  UBRI  (Fig.  5  A),  it 
became  low  enough  to  result  in  growth  arrest  at  the 
standard  concentration  of  histidine  in  the  medium 
(Fig.  1  A). 

Previous  work  (Byrd  et  al.  1998)  has  shown  that  the 
N-end  rule  pathway  strongly  up-regulates  the  expression 
of  S.  cerevisiae  Ptr2p,  a  peptide  transporter,  through  the 
degradation  of  Cup9p,  a  transcriptional  repressor  of 
PTR2.  We  therefore  tested  whether  the  Ubrlp-depen- 
dent  increase  of  HIPI  expression  in  AVY200  isln2  Pgal- 
UBRI)  cells  (Fig.  5)  involves  the  degradation  of  Cup9p, 
which  is  known  to  either  repress  or  activate  several  yeast 
genes  (G.  Turner,  H.  Rao,  and  A.V.,  unpublished  data). 
A  sln2  cup9A  Pgal-UBRI  strain  was  constructed;  the 
levels  of  HIPI  mRNA  were  compared  between  sln2 
CUP9  and  sln2  cup9A  strains,  and  were  found  to  be 
indistinguishable  (Fig.  5  A).  Thus,  the  effect  of  the  N- 
end  rule  pathway  on  the  expression  of  HIPI  is  unlikely 
to  involve  Cup9p.  These  Northern-hybridization  data 
(Fig.  5  A)  were  also  consistent  with  the  observation  that 
a  deletion  of  CUP9  did  not  rescue  the  inviability  of 
AVY200  (sln2  Pgal-UBRI)cqI\s  on  SD  plates  (data  not 
shown). 


SLN2  is  CIN8 

As  described  above,  both  UBRI  and  HIS3  acted  as  ex- 
tragenic  suppressors  of  the  sln2  mutation.  To  identify 
the  SLN2  gene,  we  searched  for  other  phenotypes  of 
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AVY200  {sln2  Pgal-UBRI)  cells,  and  found  that  they 
were  inviable  at  37  °C.  The  result  of  crossing  AVY200 
with  the  SLN2  AVY199  strain  (Table  1)  indicated  that 
the  phenotype  of  temperature  sensitivity  (ts)  was  reces¬ 
sive  (data  not  shown).  Interestingly,  the  ts  phenotype  of 
sln2  cells  could  not  be  rescued  by  either  U£R1  (Fig.  1  D) 
or  HIS3  (data  not  shown),  indicating  that  the  sln2  mu¬ 
tation  affected  not  only  histidine  uptake.  Since  the  sln2l 
SLN2  (AVY200/AVY199)  diploid  sporulated  ineffi¬ 
ciently,  and  the  viability  of  the  few  spores  that  did  form 
was  very  low,  it  was  difficult  to  preclude,  through  tetrad 
analysis,  the  alternative  possibility  that  sln2  cells  actually 
bore  more  than  one  mutant  gene.  Therefore  we  at¬ 
tempted  to  identify  a  gene(s)  that  would  rescue  the  ts 
phenotype  of  sln2  cells,  and  thereafter  to  determine 
whether  that  gene  could  rescue  the  histidine-uptake  de¬ 
fect  as  well. 

A  standard  complementation  screen  (see  Materials 
and  methods)  yielded  CIN8,  a  previously  described 
gene  encoding  a  kinesin-like  protein  (Hoyt  et  al.  1992; 
Roof  et  al.  1992).  CIN8  was  found  to  alleviate  not 
only  the  ts  phenotype  of  sln2  cells,  but  their  histidine- 
uptake  defect  as  well  (Fig.  6  B),  suggesting  that  the 
sought  SLN2  gene  may  be  CIN8.  To  determine 
whether  the  AVY200  {sln2  ^gal-UBRI)  strain  carried 
a  mutant  allele  of  CIN8,  we  isolated  it  from  these  cells 
using  the  method  of  gap  repair  (Fig.  4  B  and  see 
Materials  and  methods).  Sequencing  of  this  allele, 
termed  CIN8*,  showed  the  presence  of  the  A  T 
mutation  at  position  1237,  resulting  in  a  premature 
stop  codon  (TGA)  at  codon  413  of  the  1038-codon 
CIN8  ORF  (data  not  shown).  The  resulting  truncated 
product  would  lack  the  entire  stalk/tail  region  and  a 
portion  of  the  putative  motor  domain  (Hoyt  et  al. 
1992).  As  expected,  CIN8*,  in  contrast  to  CIN8,  was 
unable  to  rescue  the  histidine-uptake  defect  (inviability 
on  SD  plates)  of  AVY200  cells  (Fig.  6  B).  These  re¬ 
sults  made  it  very  likely  that  the  SLN2  gene  was  in 
fact  CIN8. 

To  produce  independent  evidence  bearing  on  this 
conjecture,  we  deleted  the  bulk  of  CIN8  from  the 
JD52  ihis3A  UBRl)  and  JD54  {his3A  Vgal'UBRI) 
strains  (the  latter  strain  is  Ubrl"  on  dextrose)  (see 
Fig.  4  C  and  Materials  and  methods).  The  resulting 
cinSA  strains  AVY203  {his3A  cin8A  UBRl)  and 
AVY202  (his3A  cin8A  Bgal-UBRI)  (Table  1)  were 
tested  for  viability  on  SD  plates,  i.e.,  for  comple¬ 
mentation  of  the  histidine  uptake  defect.  Whereas  the 
AVY203  {cm8A  UBRl)  cells  were  viable  under  these 
conditions,  the  AVY202  (his3A  cinSA  ^gal-UBRI) 
cells  were  inviable  (Fig.  6  C),  a  set  of  phenotypes  in¬ 
distinguishable  from  that  conferred  by  the  sln2  muta¬ 
tion.  Moreover,  as  would  have  be  expected  if  SLN2 
and  CIN8  were  the  same  gene,  the  HIS3  gene  was 
found  to  rescue  Ubrlp-lacking  cinSA  cells  on  SD 
plates  (Fig.  6  D),  identically  to  the  effect  of  HIS3  on 
sln2  cells  (Fig.  2  A).  Taken  together,  these  results 
(Figs.  4  and  6)  indicated  that  sln2  was  a  defective 
(truncated)  allele  of  the  CJN8  gene. 


Discussion 

We  report  the  following  results. 

(1)  A  synthetic  lethal  screen  for  S.  cerevisiae  mutants 
that  require  the  presence  of  the  N-end  rule  pathway 
(specifically,  the  presence  of  its  recognition  component 
Ubrlp)  for  viability  yielded  a  mutant  termed  sln2.  The 
viability  of  sln2  ubrl  cells  (specifically,  of  sln2  Vgal- 
UBRl  cells  on  dextrose-containing  media)  could  be 
rescued  by  UBRl  expressed  at  either  normal  or  in¬ 
creased  levels. 

(2)  The  HIS3  gene  (which  was  absent  from  the  pa¬ 
rental  “wild-type”  strain)  was  found  to  rescue  the  invi¬ 
ability  of  Ubrlp-lacking  sln2  cells.  High  ( >  75  pg/ml) 
concentrations  of  histidine  in  the  medium  could  also 
rescue  these  cells. 

(3)  This  defect  in  histidine  uptake,  exhibited  by 
Ubrlp-lacking  sln2  cells  but  not  by  SLN2  ubrlA  cells, 
was  traced  to  the  HIPl  gene,  encoding  the  histidine 
transporter.  The  HIPl  allele  in  Ubrlp-lacking  sln2  cells 
was  wild  type,  but  the  level  of  HIPl  mRNA  in  these  cells 
was  reproducibly  about  2-fold  lower  than  in  parental 
(SLN2)  cells.  In  addition,  the  relative  rate  of  '^C-histi- 
dine  uptake  by  Ubrlp-lacking  sln2  HIS3  cells  was  about 
35%  of  that  by  SLN2  HIS3  cells  and  about  50%  of  that 
by  sln2  UBRl  HIS3  cells.  The  incomplete  shutoff  of 
histidine  import  in  Ubrlp-lacking  sln2  cells  was  consis¬ 
tent  with  their  rescue  by  high  concentrations  of  histidine 
in  the  medium.  Further,  the  incomplete  restoration  of 
histidine  import  in  Ubrlp-expressing  sln2  cells  was 
consistent  with  their  detectably  slower  growth  on  SD 
plates  in  comparison  to  either  SLN2  UBRl  or  SLN2 
ubrlA  cells.  Finally,  a  doubling  of  the  HIPl  copy 
number  in  Ubrlp-lacking  sln2  cells  partially  comple¬ 
mented  their  inviability  on  SD  plates.  Thus,  the  histi¬ 
dine-uptake  defect  of  Ubrlp-lacking  sln2  cells  was 
caused  at  least  in  part  by  underexpression  of  Hiplp. 

(4)  Yet  another  property  of  the  sln2  mutant  was  its 
inviability  at  37  °C,  which  could  not  be  rescued  by  either 
HIS3  or  UBRL  This  made  possible  the  cloning  of  SLN2, 
which  was  found  to  be  a  previously  characterized  gene 
called  CIN8,  encoding  a  kinesin-like  protein  (Hoyt  et  al. 
1992;  Roof  et  al.  1992).  The  dependence  of  histidine 
transport  by  cinSA  cells  on  the  presence  of  Ubrlp  is  a 
previously  unknown  property  of  cinSA  cells. 

CinSp  participates  in  the  assembly  of  a  bipolar 
spindle.  cinS  mutants  are  viable,  owing  to  the  presence  of 
Kip  Ip,  another  kinesin-related  motor  protein  and  a 
functional  homolog  of  CinSp;  at  least  one  of  the  two 
proteins  is  required  for  cell  viability  (Hoyt  et  al.  1992; 
Roof  et  al.  1992).  Geiser  et  al.  (1997)  identified  a  num¬ 
ber  of  other  genes  that  are  required  for  viability  in  the 
absence  of  CINS.  These  genes  encode  proteins  that 
function  in  cell-division  pathways  (spindle  motors, 
microtubule  stabilizers,  checkpoint  regulators). 

In  retrospect,  it  was  an  accident  of  the  absence  of 
HIS3  (rather  than  of  another  genetic  marker)  from  the 
parental  JD54  strain  (Pgal-UBRI)  that  led  us  to  the 
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finding,  through  a  synthetic  lethal  screen,  of  a  connec¬ 
tion  between  a  kinesin-Iike  protein,  the  N-end  rule 
pathway,  and  regulation  of  the  histidine  import.  Pre¬ 
liminary  tests  for  the  ability  of  Ubrlp-lacking  sln2  cells 
to  import  either  dipeptides  or  several  amino  acids  other 
than  histidine  indicated  approximately  wild-type  rates  of 
uptake  (data  not  shown). 

A  short-lived  substrate  of  Ubrlp  that  mediates  its 
influence  on  the  phenotype  of  cinSA  {sln2)  cells  is  un¬ 
known.  At  the  same  time,  it  is  clear  that  a  connection 
between  the  N-end  rule  pathway  and  a  CinSp-dependent 
process  involves  the  expression  of  the  histidine  trans¬ 
porter  Hiplp.  A  parsimonious  model  which  is  consistent 
with  the  available  evidence  posits  a  short-lived  tran¬ 
scriptional  repressor  of  the  HIPl  gene  that  is  degraded 
by  the  N-end  rule  pathway,  and  is  also  down-regulated, 
in  an  unknown  manner,  through  a  CinSp-dependent 
pathway  (Fig.  7  A).  This  model  is  analogous  to  the 
previously  deciphered  circuit  that  controls  the  expres¬ 
sion  of  the  peptide  transporter  Ptr2p  in  S.  cerevisiae.  It 
was  found  that  Cup9p,  a  homeodomain-containing 
transcriptional  repressor  of  PTR2,  is  a  short-lived  pro¬ 
tein  targeted  for  Ubrlp-dependent  degradation,  appar¬ 
ently  through  an  internal  degron  of  Cup9p  (Alagramam 
et  al.  1995;  Byrd  et  al.  1998).  In  the  absence  of  Ubrlp, 
the  increased  concentration  of  Cup9p  results  in  a  vir¬ 
tually  complete  shutoff  of  the  PTR2  gene  and  cessation 
of  peptide  import  (Byrd  et  al.  1998).  We  know  (see 
Results)  that  Cup9p  is  not  involved  in  the  regulation  of 
HIPl  expression. 

An  alternative  model  invokes  a  positive  regulator  of 
HIPl  that  is  positively  regulated  by  both  the  N-end  rule 
and  Cin8p-mediated  pathways  (Fig.  7  B).  A  transporter 
homolog  Ssylp  of  S.  cerevisiae  has  been  shown  to  be 
required  for  the  expression  of  several  amino-acid 
transporters,  including  Bap2p,  Bap3p  and  Tatlp,  and 
also  the  peptide  transporter  Ptr2p  (Didion  et  al.  1998). 
Ssylp  appears  to  function  at  least  in  part  as  a  sensor  of 
specific  amino  acids  in  the  medium.  Although  SSYl  is 
unnecessary  for  histidine  uptake  (Didion  et  al.  1998),  it 
is  possible  that  an  analogous  plasma  membrane-em¬ 
bedded  sensor  activates  the  synthesis  of  other  trans¬ 
porters,  including  Hiplp.  The  absence  of  Cin8p  may 
affect  the  secretory  pathway,  resulting  in  lower  amounts 
of  the  postulated  sensor  in  the  plasma  membrane. 
Ubrlp,  the  E3  component  of  the  ubiqui  tin-dependent  N- 
end  rule  pathway,  may  up-regulate  this  sensor;  for  ex¬ 
ample,  through  the  degradation  of  another  E3  enzyme 
that  targets  the  sensor  for  endocytosis  and  degradation 
in  the  vacuole.  The  involvement  of  the  ubiquitin  system 
in  the  regulated  endocytosis  and  lysosome/vacuole  de¬ 
livery  of  transmembrane  proteins  has  previously  been 
demonstrated  in  both  yeast  and  multicellular  eukaryotes 
(Galan  et  al.  1996;  Hicke  and  Riezman  1996). 

One  prediction  of  the  repressor-based  model  (Fig.  7 
A)  is  that  inactivation  of  the  repressor  of  HIPl  should 
suppress  the  down-regulation  of  HIPl  caused  by  the 
absence  of  both  Cin8p  and  Ubrlp.  A  genetic  screen  for 
mutants  of  this  class  is  under  way.  An  analogous  earlier 


Fig.  7  A3  Possible  interconnections  between  the  N-end  rule  path¬ 
way,  the  regulation  of  histidine  import,  and  a  kinesin-dependent 
pathway.  A  In  this,  apparently  most-parsimonious  model,  a  short¬ 
lived  transcriptional  repressor  of  the  HIPl  gene  is  de^aded  by  the  N- 
end  rule  pathway,  and  is  also  down-regulated,  in  an  unknown 
manner,  through  a  CinSp-dependent  pathway.  B  in  an  alternative 
model,  a  positive  regulator  of  HIPl  is  positively  regulated  by  both  the 
N-end  rule  and  CinSp-mediated  pathways 

screen  for  suppressors  of  the  inability  of  ubrlA  cells  to 
import  peptides  has  led  to  the  identification  of  Cup9p  as 
a  transcriptional  repressor  of  the  peptide  transporter 
gene  PTR2  (Byrd  et  al  1998). 
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The  N-end  rule  relates  the  in  vivo  half-life  of  a  protein  to  the  identity  of  its  N-terminal  residue.  The 
underlying  ubiquitin-dependent  proteolytic  system,  called  the  N-end  rule  pathway,  is  organized  hierarchically: 
N-terminal  aspartate  and  glutamate  (and  also  cysteine  in  metazoans)  are  secondaiy  destabilizing  residues,  in 
that  they  function  through  their  conjugation,  by  arginyl-tRNA-protein  transferase  (R-transferase),  to  arginine, 
a  primary  destabilizing  residue.  We  isolated  cDNA  encoding  the  516-residue  mouse  R-transferase,  ATElp,  and 
found  two  species,  termed  mAAtel-2,  ThtAtel  mRNAs  are  produced  through  a  most  unusual  alternative 
splicing  that  retains  one  or  the  other  of  the  two  homologous  129-bp  exons,  which  are  adjacent  in  the  mouse 
gene.  Human  also  contains  the  alternative  129-bp  exons,  whereas  the  plant  {Ardbidopsis  thaliana)  and  fly 
{Drosophila  melanogaster)  Atel  genes  encode  a  single  form  of  ATElp.  A  fusion  of  ATEl-lp  with  green  fluorescent 
protein  (GFP)  is  present  in  both  the  nucleus  and  the  cytosol,  whereas  ATEl-2p-GFP  is  exclusively  cytosolic. 
Mouse  ATEl-lp  and  ATEl-2p  were  examined  by  expressing  them  in  atel^  Saccharomyces  cerevisiae  in  the 
presence  of  test  substrates  that  included  Asp-pgal  (p-galactosidase)  and  Cys-pgal.  Both  forms  of  the  mouse 
R-transferase  conferred  instability  on  Asp-Pgal  (but  not  on  Cys-pgal)  through  the  arginylation  of  its  N- 
terminal  Asp,  the  ATEl-lp  enzyme  being  more  active  than  ATEl-2p.  The  ratio  of  Atel-1  to  Atel‘2  mRNA  varies 
greatly  among  the  mouse  tissues;  it  is  ^0.1  in  the  skeletal  muscle,  "^0.25  in  the  spleen,  ^3.3  in  the  liver  and 
brain,  and  ~10  in  the  testis,  suggesting  that  the  two  R-transferases  are  functionally  distinct. 


The  half-lives  of  intracellular  proteins  range  from  a  few 
seconds  to  many  days.  The  rates  of  processive  proteolysis  are  a 
function  of  the  cell’s  physiological  state  and  are  controlled 
differentially  for  specific  proteins.  In  particular,  most  of  the 
damaged  or  otherwise  abnormal  proteins  are  metabolically 
unstable.  Many  other  proteins,  while  long-lived  as  components 
of  larger  macromolecular  structures  such  as  ribosomes  and 
oligomeric  proteins,  are  metabolically  unstable  as  free  sub¬ 
units.  Regulatory  proteins  are  often  also  short-lived  in  vivo, 
providing  a  way  to  generate  their  spatial  gradients  and  to 
rapidly  adjust  their  concentrations,  or  subunit  compositions, 
through  changes  in  the  rate  of  their  synthesis  or  degradation 
(20,  23,  28,  39,  44,  55). 

The  posttranslational  conjugation  of  arginine  (Arg)  to  the  N 
termini  of  eukaryotic  proteins  was  described  35  years  ago  (26), 
but  the  function  of  this  modification,  and  of  the  enzyme  in¬ 
volved,  Arg-tRNA-protein  transferase  (R-transferase)  (47),  re¬ 
mained  unknown  until  the  discovery  that  the  identity  of  N- 
terminal  residue  in  a  protein  influences  its  metabolic  stability 
(4).  The  resulting  relation  was  termed  the  N-end  rule  (54). 
Aspartate  (Asp)  and  glutamate  (Glu),  the  two  N-terminal  res¬ 
idues  known  to  be  arginylated  by  R-transferase  (47),  were 
shown  to  be  destabilizing  residues  in  the  N-end  rule  (4).  It  was 
therefore  proposed  (4)  that  the  function  of  R-transferase  is  to 
target  proteins  for  degradation  by  conjugating  Arg,  one  of  the 
primary  destabilizing  residues,  to  secondary  destabilizing  N- 
terminal  residues  (Asp  and  Glu  in  fungi;  Asp,  Glu,  and  Cys  in 
metazoans  (18)  (Fig.  1).  It  was  also  proposed  (4)  that  the 
analogous  prokaryotic  enzyme  Leu,  Phe-tRNA-protein  trans¬ 
ferase  (L,  F-transferase)  (47)  mediates  the  activity  of  N-termi- 
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nal  Arg  and  Lys,  which,  in  prokaryotes,  would  be  the  secondary 
destabilizing  residues.  These  conjectures  were  confirmed  (7, 
17,  45,  53). 

The  similar  but  distinct  degradation  signals  which  together 
give  rise  to  the  N-end  rule  are  called  the  N-degrons  (54, 56).  In 
eukaryotes,  an  N-degron  comprises  two  determinants:  a  desta¬ 
bilizing  N-terminal  residue  and  an  internal  Lys  residue  of  a 
substrate  (5,  22).  The  Lys  residue  is  the  site  of  formation  of  a 
substrate-linked  multiubiquitin  chain  (11).  The  N-end  rule 
pathway  is  thus  one  pathway  of  the  ubiquitin  (Ub)  system.  Ub 
is  a  76-residue  protein  whose  covalent  conjugation  to  other 
proteins  plays  a  role  in  a  multitude  of  processes,  including  cell 
growth,  division,  differentiation,  and  responses  to  stress  (20, 
23,  39,  55).  In  many  of  these  processes,  Ub  acts  through  routes 
that  involve  the  degradation  of  Ub-protein  conjugates  by  the 
26S  proteasome,  an  ATP-dependent  multisubunit  protease  (9, 
13,  40,  43). 

(In  the  text  that  follows,  names  of  mouse  genes  are  in  italics, 
with  the  first  letter  uppercase.  Names  of  human  and  Saccha¬ 
romyces  cerevisiae  genes  are  also  in  italics,  all  uppercase.  If 
human  and  mouse  genes  are  named  in  the  same  sentence,  the 
mouse  gene  notation  is  used.  Names  of  5.  cerevisiae  proteins 
are  roman,  with  the  first  letter  uppercase  and  an  extra  lower¬ 
case  “p”  at  the  end.  Names  of  the  corresponding  mouse  and 
human  proteins  are  the  same,  except  that  all  letters  but  the  last 
“p”  are  uppercase.  The  latter  usage  is  a  modification  of  the 
existing  convention  (50),  to  facilitate  simultaneous  discussions 
of  yeast,  mouse,  and  human  proteins.  In  some  citations,  the 
abbreviated  name  of  a  species  precedes  the  gene’s  name,) 

The  N-end  rule  is  organized  hierarchically.  In  the  yeast  S. 
cerevisiae,  Asn  and  Gin  are  tertiary  destabilizing  N-terminal 
residues  in  that  they  function  through  their  conversion,  by  the 
-encoded  N-terminal  amidase  (Nt-amidase)  (6),  to  the 
secondary  destabilizing  N-terminal  residues  Asp  and  Glu.  The 
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FIG.  1.  Comparison  of  enzymatic  reactions  that  underlie  the  activity  of  the 
tertiary  and  secondary  destabilizing  residues  among  eukaryotes.  (A)  Mammals 
(reference  54  and  this  work);  (B)  the  yeast  S.  cerevisiae  (5).  N-terminal  residues 
are  indicated  by  single-letter  abbreviations  for  amino  acids;  ovals  denote  the  rest 
of  a  protein  substrate.  The  Ntanhtncodtd.  mammalian  Nt-amidase  converts 
N-terminal  Asn  to  Asp,  whereas  N-terminal  Gin  is  deamidated  by  a  distinct 
Nt-amidase  that  remains  to  be  identified  (19,  51).  In  contrast,  the  yeast  Nt- 
amidase  can  deamidate  either  N-terminal  Asn  or  Gin  (6).  The  secondary  desta¬ 
bilizing  residues  Asp  and  Glu  are  arginylated  by  the  mammalian  ATEl-lp  or 
ATEl-2p  R-transferase  (see  Results).  A  Cys-specific  mammalian  R-transferase 
remains  to  be  identified  (see  Results).  N-terminal  Arg,  one  of  the  primary 
destabilizing  residues  (54),  is  recognized  by  N-recognin  (E3)  (see  the  introduc¬ 
tion). 


destabilizing  activity  of  N-terminal  Asp  and  Glu  requires  their 
conjugation,  by  the  5.  cerevisiae  -encoded  Arg-tRNA- 
protein  transferase  (R-transferase),  to  Arg,  one  of  the  primary 
destabilizing  residues  (7)  (Fig.  IB).  In  mammals,  the  deami¬ 
dation  step  is  bifurcated,  in  that  two  distinct  Nt-amidases  spe¬ 
cific,  respectively,  for  N-terminal  Asn  and  Gin,  mediate  the 
activity  of  tertiary  destabilizing  residues  (19,  51)  (Fig.  lA). 
Mice  lacking  the  Asn-specific  Nt-amidase  NTANlp  have  re¬ 
cently  been  produced  through  targeted  mutagenesis  and  found 
to  be  fertile,  outwardly  normal,  but  behaviorally  distinct  from 
their  congenic  wild-type  counterparts  (28a).  In  mammals,  the 
set  of  secondary  destabilizing  residues  contains  not  only  Asp 
and  Glu  but  also  Cys,  which  is  a  stabilizing  residue  in  yeast  (18, 
54)  (Fig.  1). 

The  primary  destabilizing  N-terminal  residues  are  bound 
directly  by  the  -encoded  N-recognin  (also  called  E3a), 
the  recognition  component  of  the  N-end  rule  pathway  (8).  In  k 
cerevisiae,  N-recognin  is  a  225-kDa  protein  that  binds  to  po¬ 
tential  N-end  rule  substrates  through  their  primary  destabiliz¬ 
ing  N-terminal  residues,  Phe,  Leu,  Trp,  Tyr,  He,  Arg,  Lys,  and 
His  (54).  The  Ubrl  genes  encoding  mouse  and  hurnan  N- 
recognins,  also  called  Ea  (21,  41),  have  been  cloned  (29),  and 
mouse  strains  lacking  Ubrl  have  recently  been  constructed 
(29a). 

The  known  functions  of  the  N-end  rule  pathway  include  the 
control  of  peptide  import  in  5.  cerevisiae,  through  the  degra¬ 
dation  of  Cup9p,  a  transcriptional  repressor  of  PTR2  which 
encodes  the  peptide  transporter  (2, 10);  a  role  in  regulating  the 
Slnlp-dependent  phosphorylation  cascade  that  mediates  os¬ 
moregulation  in  5.  cerevisiae  (38);  the  degradation  of  Gpalp,  a 
Ga  protein  of  S.  cerevisiae  (34);  and  the  conditional  degrada¬ 
tion  of  alphaviral  RNA  polymerase  in  virus-infected  metazoan 
cells  (16,  54).  Physiological  N-end  rule  substrates  were  also 
identified  among  the  proteins  secreted  into  the  host  cell’s  cy¬ 
tosol  by  intracellular  parasites  such  as  the  bacterium  Listeria 
monocytogenes  (46).  Inhibition  of  the  N-end  rule  pathway  was 
reported  to  interfere  with  mammalian  cell  differentiation  (24) 
and  to  delay  limb  regeneration  in  amphibians  (52).  Microar¬ 
ray-based  comparisons  of  gene  expression  patterns  in  wild-type 
and  congenic  ubrl  A  strains  of  5.  cerevisiae  have  shown  that  a 
number  of  yeast  genes,  of  diverse  functions,  are  significantly 


up-  or  down-regulated  in  the  absence  of  the  N-end  rule  path¬ 
way  (39a). 

The  mammalian  counterpart  of  the  yeast  -encoded 
R-transferase  was  partially  purified  from  rabbit  reticulocytes 
and  shown  to  cofractionate  with  Arg-tRNA  synthetase  (12). 
Recent  studies  of  the  Ub-dependent  proteolysis  of  endogenous 
proteins  in  muscle  extracts  suggested  that  the  N-end  rule  path¬ 
way  plays  a  major  role  in  catabolic  states  that  result  in  muscle 
atrophy  (48,  49).  A  significant  fraction  of  the  N-end  rule  path¬ 
way’s  activity  in  muscle  extracts  was  found  to  be  tRNA  depen¬ 
dent,  indicating  the  involvement  of  R-transferase  (48,  49).  It 
was  also  reported  that  a  crush  injury  to  the  rat  sciatic  nerve 
results  in  a  ~  10-fold  increase  in  the  rate  of  arginine  conjuga¬ 
tion  to  the  N  termini  of  unidentified  proteins  in  the  nerve’s 
region  upstream  of  the  crush  site  (15, 57),  suggesting  an  injury- 
induced  increase  in  the  concentration  of  R-transferase  sub¬ 
strates  and/or  an  enhanced  activity  of  the  N-end  rule  pathway. 

In  this  work,  we  began  the  functional  analysis  of  mammalian 
R-transferase  (ATElp)  by  isolating  mouse  cDNA  encoding 
this  enzyme.  Surprisingly,  we  found  two  Atel  cDNA  species, 
which  were  identical  except  for  a  129-bp  region  that  encoded 
similar  but  distinct  sequences.  One  or  the  other,  but  not  both, 
of  the  corresponding  y4re7  exons  is  retained  in  the  mature 
mRNA,  and  the  ratio  of  the  resulting  two  species,  Ate  1-1  and 
Atel-2,  varies  greatly  among  mouse  tissues.  We  also  show  that 
ATEl-lp  and  ATEl-2p,  while  differing  in  activity,  can  argin- 
ylate  N-terminal  Asp  and  Glu  in  model  substrates.  However, 
neither  of  them  can  arginylate  N-terminal  Cys,  the  known 
secondary  destabilizing  residue  (18),  suggesting  the  existence 
of  a  distinct  Cys-specific  mammalian  R-transferase. 

MATERIALS  AND  METHODS 

Strains  and  plasmids.  The  S.  cerevisiae  strains  used  were  JD55  {MATa  ura3-52 
his3-A200  leu2-3,112  trpl-^63  lys2-801  ubrl AyJiIS3)  (34)  and  SGY3  (MATa 
ura3-52  lys2‘801  ade2-101  trphA63  his3-A200  Ieu2-Alatel-A2:±EU2)  (19a).  Cells 
were  grown  in  rich  medium  (YPD)  or  in  synthetic  media  (SD)  containing  0.67% 
yeast  nitrogen  base  without  amino  acids  (Difco),  auxotrophic  nutrients,  and  2% 
glucose.  To  induce  the  ^cal  promoter,  glucose  was  replaced  by  2%  galactose 
(SG  medium).  Transformation  of  S.  cerevisiae  was  performed  by  the  lithium 
acetate  method  (3). 

The  ubrl  A  atel  A  double  mutant  AVY34  was  constructed  by  replacing  93%  of 
the  ATEl  open  reading  frame  (ORF)  (the  first  470  codons)  in  strain  JD55 
(^ubrlA)  by  the  LEU2  gene,  through  homologous  recombination  (42)  with  the 
introduced  LEU2  gene  flanked  on  either  side  by  40  bp  of  ylTEZ-specific  se¬ 
quences.  Mutants  were  selected  on  SD  (lacking  Leu  and  His)  plates,  and  Leu"^ 
isolates  were  checked  by  PCR  for  the  absence  of  ATEl  and  by  colony  assays  (7, 
8)  for  the  absence  of  Ate  Ip  and  Ubrlp  activity.  High-copy-number  pUB23-X 
plasmids  expressing  Ub-X-pgal  proteins  (see  below)  from  ?cal  S.  cerevisiae 
have  been  described  elsewhere  (4).  Mouse  Atel -1  and  Atel-2  cDNAs  (see  below) 
were  subcloned  into  the  low-copy-number  vector  p414GALl  (36),  using  the 
engineered  BamVil  (5')  and  Xhol  (3')  restriction  sites,  yielding  plasmids  pATl 
and  pAT2.  For  localization  assays  with  green  fluorescent  protein  (GFP),  cDNAs 
encoding  mouse  ATEl-lp  or  ATEl-2p  were  subcloned  into  the  pEGFP-Nl 
N-terminal  protein  fusion  vector  (Clontech,  Palo  Alto,  Calif.),  using  the  engi¬ 
neered  ATzoI  (5')  and^gel  (3')  restriction  sites,  yielding  plasmids  pATl-GFP  and 
PAT2-GFP. 

Isolation  of  the  mouse  Atel-1  and  Atel-2  cDNAs.  The  392-bp  fragment  of  the 
mouse  EST  (expressed  sequence  tag)  clone  (accession  no.  AA415294),  which  was 
identified  in  GenBank  through  species  walking  (see  Results),  was  used  as  a  probe 
to  screen  a  \gtlO-based  mouse  cDNA  library  from  MEL-C19  cells  (Clontech), 
using  standard  procedures  (3).  Eight  positive  clones,  whose  inserts  ranged  from 
0.5  to  1.6  kb,  were  analyzed  by  PCR  and  partial  sequencing.  The  cDNA  inserts 
of  clones  3  and  8  were  then  subcloned  into  pBluescript  II  SK"^  (29)  and  se¬ 
quenced  on  both  strands.  The  resulting  ORFs  were  identical  except  for  a  129-bp 
internal  region  (see  Results)  (Fig.  2A).  The  deduced  amino  acid  sequences  of  the 
mouse  cDNA  clones  3  and  8  were  weakly  but  significantly  similar  to  the  deduced 
sequences  of  Caenorhabditis  elegans  and  5.  cerevisiae  ATElp  (see  Results)  (Fig. 
2B)  and  corresponded  to  nucleotides  (nt)  699  to  1870  and  587  to  2099,  respec¬ 
tively,  in  the  subsequently  produced  full-length  mouse  Atel-1  and  Atel-2  cDNAs 
(accession  no.  AF079096  and  AF079097). 

A  human  EST  clone  (accession  no.  AA503372)  whose  deduced  amino  acid 
sequence  was  highly  similar  to  that  of  the  partial  mouse  v4re7  cDNA  clone  8  was 
found  in  GenBank,  using  the  partial  mouse  ATElp  sequence  as  a  query.  This 
EST  clone  was  purchased  from  Genome  Systems  (St.  Louis,  Mo.)  and  sequenced 
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to  obtain  more  of  the  S'-proximal  human /I TE/  sequence.  Reverse  transcription- 
PCR  (RT-PCR)  (3)  was  then  carried  out  with  poly(A)‘^  RNA  from  mouse 
embryonic  fibroblasts,  using  the  mouse  Atel-sptcxfic  reverse  primer  S'-CCTTT 
GGTAACAAACAGACTGGCTG-3'  and  the  forward  primer  S'-TCTCATAG 
ACCGAGGATGGCGAAG'3',  whose  sequence  was  derived  from  the  above 
human  EST  clone.  The  resulting  PCR  products,  which  appeared  as  a  smear  upon 
agarose  gel  electrophoresis,  were  ligated  into  the  TA  cloning  vector  (Invitrogen, 
San  Diego,  Calif.),  and  the  ligation  mbcture  was  used  as  a  template  for  PCR  using 
a  nested  mouse  Atel  primer  5  '-CTGCAGCTGAGGCCTGCTGCATCCG-3  ^ 
and  a  vector-specific  primer  5 ' -G  i  1 1 1 CCCAGTC ACG AC-3 ' .  This  strategy 
yielded  a  single  major  DNA  species  (data  not  shown).  We  then  applied  5 '-RACE 
(rapid  amplification  of  5'  cDNA  ends)  (3),  using  the  above  RT-PCR-derived 
sequence,  to  produce  the  full-length  cDNA  as  previously  described  (19). 

Analysis  of  the  mouse  Atel  gene.  The  mouse  genomic  DNA  from  L  cells  was 
used  as  a  template  for  PCR,  using  the  Expand  high-fidelity  PCR  system  (Boehr- 
inger,  Indianapolis,  Ind.)  and  exon-specific  primers  as  previously  described  (29), 
to  produce  DNA  fragments  that  together  spanned  --4  kb  of  the  mouse  Atel  gene 
and  contained  the  two  alternative  129-bp  ^4/^7  exons.  The  regions  encompassing 
exon/intron  junctions  were  sequenced  by  using  exon-  and  intron-specific  primers. 
Thereafter,  a  strategy  described  earlier  for  the  Ubrl  gene  (29)  was  used  to  screen, 
using  a  fragment  of  the  mouse  Atel  cDNA  (nt  255-1139),  a  BAC  (bacterial 
artificial  chromosome)-based  library  of  mouse  genomic  DNA  fragments  from 
strain  129SvJ  (Genome  Systems),  yielding  one  BAC  clone  containing  the  mouse 
Atel  gene. 

Isolation  of  the  human,  plant,  and  fiy  ATEl  cDNAs.  Using  the  cloned  mouse 
ATElp  sequences  (see  above)  as  queries,  we  identified  in  GenBank  several 
significantly  similar  EST  sequences  from  other  organisms  (data  not  shown).  To 
determine  whether  these  species  also  contained  the  two  forms  oiATEl  mRNA, 
we  isolated  the  corresponding  ^r£7  cDNAs.  RT-PCR  (3)  with  poly(A)‘^  RNA 
from  human  293  cells  and  the  primers  5'-CAATGGCATGTGGGCACATTCC 
ATG-3'  (specific  for  the  human  EST  clone  AA503372  [see  above])  and  5'-CC 
ACAGGTACTGAATATGTATCCTG-3'  (specific  for  the  human  EST  clone 
AA195361)  was  carried  out,  yielding  a  1.6-kb  human  ATEl  cDNA  fragment 
lacking  the  first  41  codons  of  the.^7'£7  ORF.  This  fragment  (a  mixture  of  the  two 
alternative  cDNAs)  was  subcloned  into  the  TA  vector  (Invitrogen)  and  se¬ 
quenced  on  both  strands.  Full-length  y4?e7  cDNAs  trom  Arabidopsis  thaliana  and 
Drosophila  melanogaster  wqtc  isolated  by  RT-PCR  as  well,  using  total  RNA  from 
A.  thaliana  leaves,  poly(A)'^  RNA  from  D,  melanogaster  embryos,  and  primers 
specific  for  the  5'  and  3'  ends  of  the  corresponding  ORFs.  By  using  the  strategy 
described  above  for  the  human  ^r£7  cDNAs,  the  sequences  of  these  primers 
were  derived  fi-om  the  EST  clones  that  were  initially  identified  in  GenBank  through 
their  similarity  to  the  mouse  ATElp  sequence,  then  purchased  from  Genome  Sys¬ 
tems,  and  sequenced  prior  to  RT-PCR  with  the  corresponding  RNA  preparations. 
The  final  human,  plant,  and  ^yATEl  cDNAs  were  sequenced  on  both  strands. 

Assays  of  (1-gal.  Colony  assays  for  the  Escherichia  coli  3-gaIactosidase  (Pgal) 
in  S.  cerevisiae  were  carried  out  by  overlaying  yeast  colonies  on  SG  plates  with 
0,5%  agarose  containing  0.1%  sodium  dodecyl  sulfate  (SDS),  4%  dimethylfor- 
mamide,  and  a  0.1-mg/mI  solution  of  the  chromogenic  pgal  substrate  X-Gal 
(5-bromo-4-chloro-3-indolyl-3-D-galactopyranoside;  Calbiochem,  La  Jolla,  Cal¬ 
if.),  followed  by  incubation  for  1  to  2  h  at  37°C.  Quantitative  assays  for  Pgal  in 
S.  cerevisiae  were  carried  out  with  whole-cell  extracts,  using  another  chromogenic 
pgal  substrate,  o-nitrophenyl-p-o-galactopyranoside  (ONPG).  Cells  in  a  5-ml 
culture  (^4^00  of  ~1)  were  pelleted  by  centrifugation  and  resuspended  in  5  ml  of 
buffer  Z  (60  mM  Na2HP04,  40  mM  NaH2P04.  10  mM  KCl,  1  mM  MgS04,  50 
mM  3-mercaptoethanoI  [pH  7.0]).  After  the  A^  of  the  suspension  was  deter¬ 
mined  50-  or  lOO-pl  samples  were  diluted  to  1  ml  with  buffer  Z;  0.1%  SDS  (20 
ixl)  and  CHCI3  (50  p.1)  were  then  added;  the  suspension  was  vortexed  for  10  to 
15  s  and  incubated  for  15  min  at  30°C,  followed  by  the  addition  of  200  jxl  of 
ONPG  (4  mg/ml  in  buffer  Z)  and  further  incubation  at  30°C,  until  a  medium 
yellow  color  had  developed,  at  which  point  the  reaction  was  stopped  by  the 
addition  of  1  M  Na2C03  (0.4  ml).  The  mixture  was  centrifuged  for  5  min  at 
1,100  X  g,  and  they442o  and  >1500  of  the  samples  were  measured.  The  ONPG  units 


(Lqnpg)  of  Pgal  activity  were  calculated  as  follows:  Uqnpg  =  L900  x  [(^420)  - 
(1.75  X  /45oo)]/t  X  V  X  A^oq,  where  t  and  v  were,  respectively,  the  time  of 
incubation  (minutes)  and  the  sample  volume  (milliliters)  (3). 

Purification  and  N-terminal  sequencing  of  X-|}gai  proteins.  Extracts  were 
prepared  (using  the  liquid  nitrogen  procedure  [3])  from  S.  cerevisiae  AVY34 
{ubrl  A  atel  A)  cotransformed  with  a  pUB23-X  plasmid  (4)  (expressing  Ub-X- 
pgal)  and  either  pATl  (expressing  mouse  ATEl-lp)  or  pATZ  (expressing  mouse 
ATEl-2p).  Cultures  were  grown  in  SG  to  2inA^  of  ~1,  Specific  X-pgal  proteins 
(X  =  Asp  or  Cys)  were  purified  by  affinity  chromatography  on  ProtoSorb  lacZ 
(Promega,  Madison,  Wis.),  a  monoclonal  anti-^gal  antibody  coupled  to  agarose 
beads  (Promega).  X-3gal  proteins  were  further  purified  by  electrophoresis  on 
SDS-7%  polyacrylamide  gels  and  were  electroblotted  onto  Immobilon-P®^ 
membranes  (Millipore,  Bedford,  Mass.).  N-terminal  sequencing  of  10  to  15  pmol 
of  electroblotted  X-pgal  was  carried  out  for  at  least  five  cycles,  using  an  Applied 
Biosystems  476A  protein  sequencer  (Caltech  Microchemistry  Facility). 

Mouse  cell  cultures,  transfection,  and  GFP  localization.  NIH  3T3  cells  (ATCC 
1658-CRL)  were  grown  as  monolayers  in  Dulbecco’s  modified  Eagle  medium 
(GIBCO,  Frederick,  Md.)  supplemented  with  10%  fetal  bovine  serum.  Cells  for 
GFP  localization  analyses  were  grown  to  ~15%  confluence  on  glass  coverslips 
for  24  h  prior  to  transfection  with  either  pATl-GFP  or  pAT2-GFP,  using  Lipo- 
fectamine  (GIBCO)  and  the  manufacturer-supplied  protocol.  Cells  were  incu¬ 
bated  for  5  h  at  37®C  in  serum-free  medium  containing  DNA  and  Lipofectamine. 
Thereafter  an  equal  volume  of  medium  containing  20%  serum  was  added,  and 
the  cells  were  grown  for  another  12  to  20  h  at  37*C.  Cells  were  fixed  with  2% 
formaldehyde  in  phosphate-buffered  saline,  and  GFP  fluorescence  was  visualized 
in  a  Zeiss  Axiophot  microscope. 

Northern  hybridization.  Mouse  multiple-tissue  Northern  blots  containing  2  jxg 
of  poly(A)'*'  RNA  per  lane  (Clontech)  were  probed  with  the  ^^P-labeled  1.1-kb 
mouse  Atel  cDNA  (nt  638  to  1734),  using  the  manufacturer-supplied  protocol. 

Determination  of  the  relative  levels  of  Atel-1  and  AteD2  mRNAs.  Samples  of 
total  RNA  isolated  as  described  previously  (3)  from  mouse  spleen,  skeletal 
muscle,  liver,  brain,  testis,  and  embryonic  fibroblasts  were  subjected  to  RT-PCR 
(28  cycles).  The  primers  5'-CAGTGGAGGATGCTGTTGACGGTGAC-3'  and 
5'-GTGCTCTGCCTCCAATGGTGAGCTG-3'  were  specific  for  the  identical 
regions  of  Atel-1  and  Atel-2  cDNAs  that  flanked  the  two  129-bp  exons  (see 
Results)  which  distinguished  these  cDNAs.  The  resulting  624-bp  product  (a 
mbcture  of  Xht  Atel-1  Atel-2  cDNA  fragments)  was  treated  with  ScrFl,  which 
cuts  at  different  sites  within  the  two  129-bp  exons,  followed  by  a  2%  agarose  gel 
electrophoresis.  This  procedure  made  it  possible  to  distinguish  the  Atel-1  and 
Atel-2  fragments.  The  ratios  of  the  two  forms  of  Atel  cDNA  were  determined  by 
serial  dilutions  of  the  samples  prior  to  gel  electrophoresis. 

Nucleotide  sequence  accession  numbers.  The  nucleotide  sequences  reported 
in  this  paper  were  submitted  to  the  GenBank/EMBL  data  bank  and  assigned 
accession  no.  AF079096  (mouse  Atel-1  cDNA),  AF079097  (mouse  Atel-2  cDNA), 
AF079098  (human  v4/e7-7  cDNA),  AF079099  (human  ^r£7-2  cDNA),  AF079100 
{A.  thaliana  Atel  cDNA),  and  AF079101  {D.  melanogaster  Atel  cDNA). 

RESULTS 

Identification  of  mouse  Atel  cDNAs  by  species  walking.  On 
the  assumption  that  the  sequences  of  R-transferases  in  differ¬ 
ent  species  might  be  sufficiently  conserved  to  be  detected  by 
using  the  sequence  of  the  only  cloned  R-transferase,  S.  cerevi¬ 
siae  Ate  Ip  (7),  we  have  been  searching  GenBank  and  related 
databases.  No  mammalian  sequences  in  GenBank,  including 
the  EST  sequences,  had  significant  similarities  to  S.  cerevisiae 
Ate  Ip.  However,  we  did  identify  a  nematode  (C  elegans)  ORF 
(accession  no.  Z21146)  that  exhibited  similarity  to  yeast  Atelp 
(Fig.  2B)  and  then  used  the  C.  elegans  sequence  to  identify  a 


FIG.  2.  Two  forms  of  the  mouse  v4/e7  cDNA  and  the  ATE  protein  family.  (A)  The  mouse  ^/e7-7  2ind  Atel-2  cDNAs  and  their  products.  The  nucleotide  sequences 
of^re7-2  identical  to  those  of  Atel-1  (everywhere  except  for  the  129-bp  region)  are  indicated  by  dashes.  In  the  region  of  the  alternative  129-bp  exons  of  Atel-1  and 
Atel-2,  white-on-black  and  gray  shadings  highlight,  respectively,  identical  and  similar  residues.  The  circled  Cys  residues  are  homologous  to  those  that  are  important 
for  the  enzymatic  activity  of  5.  cerevisiae  Atelp  (32).  (B)  The  ATE  protein  family  and  the  origins  of  the  alternative  129-bp  exons.  Alignment  of  the  sequences  of  mouse 
ATEl-lp  (Mm-Atel-1),/1.  thaliana  Atelp  (At-Atel),  D.  melanogaster  Atelp  (Dm-Atel),  C,  elegans  Atelp  (Ce-Atel),  and  S.  cerevisiae  Atelp  (Sc-Atel)  (accession  no, 
J05404).  Similar  residues  (gray)  were  grouped  as  follows:  M,  L,  I,  and  V;  D,  E,  N,  and  Q;  R,  K,  and  H;  Y,  F,  and  W;  S,  A,  and  T.  The  region  encoded  by  the  alternative 
129-bp  exons  of  mouse  Atel  is  highlighted  by  a  thick  line.  Of  the  Cys  residues  that  are  conserved  among  all  ATE  proteins,  the  ones  required  and  not  required  for  the 
enzymatic  activity  of  5.  cerevisiae  Atelp  (32)  are  indicated,  respectively,  by  ▼  and  V.  The  N-terminally  truncated  mouse  ATEl-lp  and  ATEl-2p  proteins  which  began 
at  Met-42  (•)  lacked  the  R-transferase  activity  (data  not  shown).  The  highly  variable  C-terminal  regions  of  ATE  proteins  were  omitted  from  the  alignment.  The 
sequences  were  aligned  using  PileUp  program  (Wisconsin  Package:  Genetics  Computer  Group,  Madison,  Wis.).  Gaps  (-)  were  introduced  to  optimize  the  alignment. 
The  residue  numbers  are  on  the  right  of  the  sequences.  The  sequence  of  C.  elegans  Atelp  appears  to  lack  the  N-terminal  region  of  other  ATE  proteins  because  of  an 
error  in  defining  xhtAtel  ORF  in  the  genomic  DNA  sequence  (accession  no.  Z21146).  (C)  Alignment  of  the  43-residue  regions  that  are  encoded  by  the  alternative 
129-bp  exons  in  mammalian  .4/e7.  Sequences  shown:  mouse  (Mm-Atel-1  and  Mm-Atel-2),  human  (Hs-Atel-1  and  Hs-Atel-2),  D.  melanogaster  (Dm-Atel),  C.  elegans 
{Cc-\xe\),  A.  thaliana  (At-Atel),  S.  pombe  (Sp-Atel:  accession  no.  Z99568),  and  5.  cerevisiae  (Sc-Atel)  (accession  no.  J05404).  The  degrees  of  identity  and  similarity 
of  ATE  proteins  to  the  deduced  amino  acid  sequences  of  the  M8  (mouse  ATEl-lp)  or  M3  (mouse  ATEl-2p)  exon  of  the  mouse  y4te7  gene  are  indicated  on  the  right. 
The  residues  conserved  among  all  of  the  compared  sequences  are  indicated  above  the  alignment. 
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FIG.  3.  Expression  oiAtel-1  and  Ate2~2  mRNAs  in  different  mouse  tissues 
and  mouse  embryonic  fibroblasts  (MEF).  Two  forms  of  Atel  cDNA  (Fig.  2A) 
were  amplified  in  a  single  reaction  by  RT-PCR,  using  the  same  primers,  to  yield 
624-bp  fragments  that  included  the  region  of  the  alternative  129-bp  exons  (see 
Materials  and  Methods).  The  624-bp  fragments  were  digested  with  5crFI,  which 
produced  a  231-bp  fragment  (a  mixture  oiAtel-1  and  AteU2\  a  284-bp,  Atel- 
/-specific  fragment,  and  a  317-bp, /Irei -2-specific  fragment.  The  untreated  (lanes 
a)  and  5crFI-treated  (lanes  b)  samples  from  different  mouse  tissues  were  ana¬ 
lyzed  by  electrophoresis  in  a  2%  agarose  gel.  The  ratio  of  the  two  forms  of  Atel 
mRNA,  defined  as  the  ratio  of  the  284-bp  (Atel-l)  fragment  to  the  317-bp 
(AteJ-2)  fragment,  was  determined  by  analyzing  serially  diluted  samples  and 
comparing  the  resulting  band  intensities  (data  not  shown),  sk.,  skeletal. 


significantly  similar  EST  sequence  of  D.  melanogaster  cDNA 
(accession  no.  AA391570).  Finally,  using  the  deduced  amino 
acid  sequence  of  the  Drosophila  EST  clone  as  a  probe,  we 
identified  a  467-bp  mouse  EST  sequence  (accession  no. 
AA415294)  that  exhibited  weak  but  significant  similarity  to  the 
Drosophila  sequence  but  no  detectable  similarity  to  S,  cerevi- 
siae  Ate  Ip.  On  a  chance  that  this  467-bp  EST  had  been  derived 
from  the  mouse  Atel  cDNA,  we  used  it  to  screen  a  mouse 
cDNA  library  and  indeed  isolated  the  putative  mouse  Atel 
cDNAs  (Fig.  2A). 

Alternative  splicing  results  in  two  species  of  mouse  Atel 
cDNA  containing  distinct  but  homologous  exons.  During  the 
initial  mouse  cDNA  library  screening,  we  found  that  the  cDNA 
clone  3  (nt  699  to  1870  ckAtel-2  cDNA)  was  identical  to  the 
cDNA  clone  8  (nt  587  to  2099  oiAtel-1  cDNA),  except  for  a 
129-bp  region  whose  deduced  amino  acid  sequences  were  sim¬ 
ilar  (31%  identity;  61%  similarity)  (Fig.  2).  The  two  full-length 
Atel  cDNAs  (termed  ylfei-i  and^fei-2),  which  were  obtained 
by  RT-PCR  followed  by  5 '-RACE  (see  Materials  and  Meth¬ 
ods),  encoded  proteins  of  identical  length,  516  residues  (59.2 
kDa  and  pi  of  8.14  versus  59.1  kDa  and  pi  of  7.22)  (Fig.  2A), 
that  contained  regions  of  similarity  to  the  57.8-kDa  Atelp  of  5. 
cerevisiae  (Fig.  2B).  RT-PCR  (followed  by  subcloning)  with 
RNAs  from  different  mouse  tissues  also  produced  the  two 
forms  of  Atel  cDNAs,  indicating  that  the  two  species  were  in 
fact  present  in  the  initial  RNA  preparation  (Fig.  3  and  data  not 
shown). 

To  determine  whether  both  of  the  two  129-bp  regions  of  the 
Atel-1  and  Atel -2  cDNAs  were  a  part  of  the  Atel  gene,  and 
whether  d.ndAtel-2  were  produced  through  alternative 

splicing,  we  analyzed  the  mouse  Atel  gene  in  the  vicinity  of  its 
two  129-bp  exons,  using  at  first  PCR  and  subsequently  a  BAC 
clone  containing  Atel  (see  Materials  and  Methods).  The  two 
129-bp  exons  were  located  next  to  each  other  in  the  Atel  gene 
(Fig.  4A).  We  also  found  that  the  12-bp  sequences  around  the 
splice  acceptor  sites  of  these  exons  (6  bp  in  the  intron  and  6  bp 
in  the  exon)  were  identical  between  the  two  exons  (Fig.  4B), 
consistent  with  the  alternative  presence  of  these  exons  in  the 
mature  Atel  mRNA.  The  exon-containing  RT-PCR  products 
from  different  mouse  tissues  appeared  as  a  single  major  band 
retaining  one  of  the  two  129-bp  exons  (Fig.  3  and  data  not 
shown).  Subcloning  and  analyses  of  these  RT-PCR  products 
yielded  no  other  differentially  spliced  Atel  cDNAs  (for  exam¬ 
ple,  cDNAs  retaining  both  or  neither  of  the  two  129-bp  exons). 
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FIG.  4.  The  two  forms  of  mouse  Atel  mRNA  are  produced  by  alternative 
splicing.  (A)  The  two  alternative  129-bp  exons  are  adjacent  in  the  mouse  Atel 
gene.  The  thick  line  denotes  genomic  DNA;  the  striped  and  black  rectangles 
denote  the  alternative  129-bp  exons,  M8  and  M3  (see  Materials  and  Methods); 
gray  rectangles  denote  the  flanking  Atel  exons,  of  unknown  sizes;  thin  lines 
denote  the  alternative  splicing  patterns  that  yield  the  two  forms  of  Atel  mRNA. 
(B)  The  underlined  12  bp  (6  bp  in  the  intron  and  6  bp  in  the  exon)  around  the 
splice  acceptor  sites  are  identical  between  the  two  alternative  129-bp  exons.  (C) 
Scale  diagrams  of  the  two  forms  of  moose  Atel  cDNAs.  The  alternative  129-bp 
exons  M8  and  M3  are  indicated  by  the  striped  and  black  boxes,  respectively. 


suggesting  that  the  splicing  oiAtel  pre-mRNA  is  tightly  regu¬ 
lated  to  retain  one  and  only  one  of  the  two  alternative  exons. 
Thus,  the  two  forms  of  Atel  mRNAs  are  produced  by  a  nearly 
unprecedented  (see  Discussion)  splicing  pathway  which  ulti¬ 
mately  yields  two  proteins  of  identical  size  that  bear  two  alter¬ 
native,  homologous  but  distinct  43-residue  internal  sequences 
(Fig.  4C). 

The  absence  of  alternative  129-bp  exons  from  the  plant  and 
fiy  Atel  genes.  To  explore  the  evolution  of  Atel,  and  especially 
the  phytogeny  of  its  alternative  129-bp  exons,  we  cloned  the 
human,  plant  {A.  thaliana),  and  fly  (D.  melanogaster)  ATEl 
cDNAs  (see  Materials  and  Methods).  Two  forms  of  the  human 
ATEl  cDNA,  termed  Hs-ATEl-1  and  Hs-ATED2,  were  iso¬ 
lated  from  human  293  cells  (the  forms’  molar  ratio  was  about 
1).  However,  only  one  form  of  the  Atel  cDNA  was  isolated 
from  either  the  leaves  of  A.  thaliana  (termed  At-Atel)  or  D, 
melanogaster  embryos  (termed  Dm-Atel),  suggesting  that  the 
alternative  129-bp  exons  may  not  be  present  in  thoAtel  genes 
of  plants  and  arthropods.  The  A.  thaliana  and  D,  melanogaster 
Atelp  proteins  were,  respectively,  629  and  477  residues  long 
(71  and  55  kDa,  with  pis  of  6.0  and  8.4).  Mouse  ATEl-lp  was 
82,  38,  and  42%  identical  (as  well  as  91,  57,  and  61%  similar) 
to  human  ATETlp,  A.  thaliana  Atelp,  and  D.  melanogaster 
Atelp,  respectively  (Fig.  2  and  data  not  shown).  A.  thaliana 
Atelp  bore  a  16-residue  region  containing  exclusively  Asp  or 
Glu  (data  not  shown). 

We  used  RT-PCR  and  RNA  preparations  from  A.  thaliana 
and  D.  melanogaster  to  amplify  the  relevant  regions  of  the 
corresponding  Atel  cDNAs.  The  resulting  fragments  were  di¬ 
gested  with  restriction  enzymes  that  recognize,  in  each  species, 
exclusively  the  region  that  corresponds  to  the  129-bp  exons  of 
the  mouse  Atel  cDNAs,  and  the  products  were  analyzed  by  gel 
electrophoresis.  The  initial  cDNA  fragments  ofyl.  thaliana  and 
D.  melanogaster  Atel  completely  disappeared  after  this  treat¬ 
ment,  in  contrast  to  the  homologous  mouse  cDNA  fragment 
(which  contained  two  distinct  sequences  of  identical  length), 
suggesting  that  the  two  alternative  exons  were  absent  from  the 
Atel  genes  of  plants  and  arthropods  (Fig.  5A). 

While  this  analysis  was  under  way,  complete  sequences  of 
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FIG.  5.  Alternative  splicing  of  Ate  J  pre-mRNA  in  mammals  (the  mouse)  but  in  neither  plants  (A.  thaliana)  nor  arthropods  (D.  melanogaster).  (A)  The  relevant  >4/^7 
cDNA  fragments  from  mouse  (794  bp).  A.  thaliana  (626  bp),  and  D.  melanogaster  (856  bp)  were  produced  by  RT-PCR  (see  Materials  and  Methods).  The  products  were 
treated  with  the  indicated  restriction  endonucleases  that  cut  exclusively  within  the  two  alternative  129-bp  exons  of  mouse  cDNAs  {BspMf  fox  Ate  I -2\  SexM  for 
Atel-1)  or  within  the  corresponding  regions  of  A.  thaliana  {Fvull  and  5crFI)  and  D.  melanogaster  {Accl  and  EcoRV)  Atel  cDNAs.  (B)  The  A.  thaliana  and  D. 
melanogaster  Atel  genes.  The  exon-intron  organization  of  these  genes  was  deduced  through  comparisons  of  their  cDNA  sequences,  determined  in  this  work  (see 
Materials  and  Methods),  with  the  concurrently  determined  sequences  of  the  corresponding  genomic  DNA  regions  (see  text).  The  horizontal  lines  and  rectangles  denote, 
respectively,  introns  and  exons,  whose  lengths  are  indicated  below  and  above  the  line  denoting  introns.  Thick  horizontal  lines  indicate  the  regions  of  A.  thaliana  and 
D.  melanogaster  cDNAs  that  correspond  to  the  alternative  129-bp  exons  of  the  mouse  and  human  Atel  cDNAs  (Fig.  2  and  4).  The  lengths  of  the  A.  thaliana  and  D. 
melanogaster  Atel  genes  are,  respectively,  ~3  and  -“2.5  kb. 


the  A.  thaliana  and  D.  melanogaster  Atel  loci,  determined 
through  the  corresponding  sequencing  projects,  were  depos¬ 
ited  in  GenBank  (accession  no.  AA005237  and  accession  no. 
AC004321,  respectively).  By  comparing  the  cloned  Atel  cDNAs 
(see  Materials  and  Methods)  and  the  corresponding  genomic 
sequences  of  A.  thaliana  and  D.  melanogaster,  we  could  deduce 
the  organization  of  these  Atel  genes.  The  results  (Fig.  5B) 
directly  confirmed  the  absence  of  the  alternative  homologous 
exons  iiomAtel  of  A.  thaliana  andD.  melanogaster,  in  contrast 
to  mammalian  The  corresponding  region  of  plant  Ate  Ip 
is  more  similar  to  the  exon-encoded  sequence  of  mouse  ATEl- 
Ip,  whereas  in  Drosophila  this  region  is  more  similar  to  the 
alternative  sequence  of  ATEl-2p  (Fig.  2C).  The  corresponding 
regions  of  5.  cerevisiae  and  Schizosaccharomyces  pombe  Ate  Ip 
are  not  preferentially  similar  to  either  of  the  two  alternative 
exon-encoding  sequences  of  mouse  ATElp  (Fig.  2C). 

Mouse  ATEl-lp  and  AT£l-2p  can  implement  the  Asp/Glu- 
specific  subset  of  the  N-end  rule  pathway  but  differ  in  activity. 
To  determine  whether  the  two  putative  mouse  R-transferases 
are  in  fact  R-transferases  and  to  compare  their  activities  in  an 
in  vivo  setting,  we  examined  whether  ATEl-lp  and  ATEl-2p 
could  confer  metabolic  instability  on  Asp-Pgal  and  Glu-(3gal  in 
atel  A  S.  cerevisiae.  Asp  and  Glu  are  secondary  destabilizing 
residues  in  the  N-end  rule  (Fig.  1  and  introduction).  The  test 
substrates  Asp-pgal  and  Glu-(3gal  (produced  through  cotrans- 
lational  deubiquitylation  of  Ub-Asp-pgal  and  Ub-Glu-pgal 
(4))  are  short-lived  in  wild-type  yeast  (half-lives  of  '^3  and  —30 
min,  respectively)  but  long-lived  (half-life  of  >20  h)  in  <27^7  A  S. 
cerevisiae  that  lacks  the  A  TEi -encoded  yeast  R-transferase  (5, 
7).  Previous  work  (19,  33)  has  shown  that  the  steady-state  level 
of  an  X-pgal  protein  is  a  sensitive  measure  of  its  metabolic 
stability. 

5.  cerevisiae  atel  ilk  cells  were  cotransformed  with  a  pair  of 


plasmids  that  expressed  one  of  the  two  putative  mouse  R- 
transferases,  ATEl-lp  or  ATEl-2p,  and  one  of  several  test 
substrates  (as  the  corresponding  IJb  fusions):  Asp-pgal,  Glu- 
pgal,  Arg-pgal,  Cys-Pgal,  or  Met-Pgal.  Met  and  C^s  are  stabi¬ 
lizing  residues  in  the  yeast  N-end  rule;  Arg  is  a  primary  desta¬ 
bilizing  residue;  Asp  and  Glu  are  secondary  destabilizing 
residues  (5,  56).  Control  tests  included  either  the  vector  alone 
or  a  plasmid  expressing  S.  cerevisiae  Atelp.  The  steady-state 
levels  of  X-Pgal  proteins  were  determined  by  measuring  the 
enzymatic  activity  of  pgal  in  yeast  extracts.  Using  this  assay,  we 
found  that  both  forms  of  mouse  ATElp  were  able  to  confer 
metabolic  instability  on  either  Asp-pgal  or  Glu-Pgal  in  atel  is  S. 
cerevisiae  (Fig.  6A).  ATEl-lp  and  ATEl-2p  destabilized  Glu- 
pgal  much  less  than  Asp-pgal  (Fig.  6A),  consistent  with  Glu 
being  a  less  destabilizing  residue  in  the  N-end  rule  than  Asp, 
presumably  because  of  less  efficient  arginylation  of  the  N- 
terminal  Glu  by  R-transferases  (54).  However,  while  the  ap¬ 
parent  destabilizing  activity  of  the  mouse  ATEl-lp  R-trans- 
ferase  was  only  slightly  lower  than  that  of  S.  cerevisiae  Atelp 
(expressed  from  the  identical  vector  and  promoter),  the  activ¬ 
ity  of  mouse  ATEl-2p  was  significantly  lower  than  that  of 
ATEl-lp  (Fig.  6A). 

We  also  asked  whether  the  two  forms  of  mouse  ATElp 
could  influence  each  other’s  activity  if  they  were  coexpressed  in 
the  same  cell  (such  an  influence  might  be  expected,  for  in¬ 
stance,  if  the  active  form  of  R-transferase  were  a  dimer  or  if 
the  two  forms  of  R-transferase  competed  for  binding  to  the 
same  component  of  a  pathway).  S.  cerevisiae  atel  is  cells  were 
cotransformed  with  two  plasmids  bearing  different  selectable 
markers  and  expressing  different  combinations  of  ATEl-lp 
and  ATEl-2p  (1  +  1,  2+2,  or  1+2),  and  also  with  a  plasmid 
expressing  one  of  the  X-Pgal  test  proteins  (X  =  Met,  A^rg,  Cys, 
Asp,  or  Glu).  Control  cells  were  cotransformed  with  the  two 
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FIG.  6.  The  two  forms  of  mouse  ATElp  can  implement  the  Asp/GIu-specific  subset  of  the  N-end  rule  pathway.  (A)  Relative  enzymatic  activities  of  pgal  in  atel^ 
5.  cerevisiae  transformed  with  plasmids  expressing  X-pgal  (as  Ub-X-Pgal)  test  proteins  (X  =  Met,  Arg,  Cys,  Asp,  or  Glu)  together  with  a  plasmid  expressing  either  yeast 
ATEl  (denoted  as  y),  mouse  ATEMp  (denoted  as  1),  mouse  ATEl-2p  (denoted  as  2),  or  the  vector  alone  (denoted  as  v).  The  N-terminal  residues  of  X-pgals  in  each 
set  of  experiments  are  indicated  at  the  top.  The  activity  of  Met-3gal  in  cells  transformed  with  vector  alone  is  taken  as  100%.  (B)  The  two  forms  of  mouse  ATElp  exhibit 
no  cooperativity  in  mediating  the  degradation  of  X-Pgals.  Shown  are  relative  enzymatic  activities  of  X-(5gals  in  atelA  S.  cerevisiae  strain  cotransformed  with  plasmids 
expressing  X-pgals  (Ub-X-3gaIs)  (X  =  Met,  Arg,  Cys,  Asp,  or  Glu)  and  the  combinations  of  plasmids  expressing  the  following  proteins:  mouse  ATEl-lp  and  ATEMp 
(1+1),  ATEl-lp  and  ATEl-2p  (1+2),  ATEl-2p  and  ATEl-2p  (2+2),  or  two  vector  controls  (v+v).  One  of  the  two  vectors  bore  the  TRPl  marker  and  the  other  bore 
the  HISS  marker  (see  Materials  and  Methods).  Results  are  averages  of  four  independent  measurements,  which  differed  by  less  than  10%. 


vectors  alone.  The  results  (Fig.  6B)  indicated  that  the  total 
activities  of  the  1+1  and  1+2  combinations  (measured  as  the 
extent  of  destabilization  of  Asp-3gal  or  Glu-Pgal)  were  similar 
to  each  other  and  much  higher  than  the  total  activities  of  2+2 
(Fig.  6B),  consistent  with  the  conjecture  that  the  two  forms  of 
mouse  R-transferase  do  not  interact  and  that  ATEl-lp  is  a 
more  (possibly  much  more)  active  enzyme  than  ATEl-2p. 

Neither  ATEl-lp  nor  ATEl-2p  confers  metabolic  instability 
on  Cys-pgaL  Cysteine  is  a  stabilizing  residue  in  the  yeast  N-end 
rule  but  a  secondaiy  destabilizing  residue  in  multicellular  or¬ 
ganisms  such  as  mammals  and  amphibians  (14, 18,  30, 54).  The 
presence  of  two  alternative  regions  in  the  two  forms  of  mouse 
R-transferase  (Fig.  4C)  initially  suggested  that  one  R-trans¬ 
ferase  might  be  specific  for  N-terminal  Asp  and  Glu,  with  the 
other  specific  for  Cys.  However,  Cys-Pgal,  which  is  long-lived 
in  wild-type  S.  cerevisiae  (5),  remained  long-lived  in  the  pres¬ 
ence  of  either  ATEl-lp  or  ATEl-2p  (Fig.  6A).  This  finding 
and,  more  directly,  the  results  of  amino  acid  sequencing  (see 
below)  suggest  the  existence  of  a  mammalian  tRNA-depen- 
dent  enzyme  (presumably  a  distinct  R-transferase)  (18)  that 
mediates  destabilizing  activity  of  N-terminal  Cys. 


Mouse  ATEl-lp  and  ATEl-2p  destabilize  Asp-Pgal  and 
Glu-pgal  through  arginylation  of  their  N-terminal  residues. 
To  verify  directly  that  mouse  ATEl-lp  and  ATEl-2p  in  fact 
possess  the  R-transferase  activity,  we  constructed  the  ateliS 
uhrl^  S.  cerevisiae  double  mutant  AVY34,  which  lacked  both 
R-transferase  and  N-recognin  (E3),  the  main  recognition  com¬ 
ponent  of  the  N-end  rule  pathway  (see  Materials  and  Meth¬ 
ods).  Consequently,  N-terminal  arginylation  of  a  test  protein  in 
this  mutant  by  an  exogenous  R-transferase  would  not  result  in 
degradation  of  the  protein,  thereby  making  it  possible  to  iso¬ 
late  enough  of  the  test  protein  for  N-terminal  sequencing. 
Strain  AVY34  was  transformed  with  pUB23-D  (expressing 
Ub-Asp-pgal)  and  also  with  either  pATl  (expressing  ATEl- 
lp),  pAi7  (expressing  ATEl-2p),  or  vector  alone  and  was 
grown  in  SG  medium.  Asp-Pgal  proteins  isolated  from  these 
transformants  were  subjected  to  N-terminal  sequencing  (see 
Materials  and  Methods).  The  results  (Table  1)  directly  con¬ 
firmed  that  both  ATEl-lp  and  ATEl-2p  possessed  R-trans¬ 
ferase  activity.  In  agreement  with  the  finding  that  ATEl-lp 
was  more  active  than  ATEl-2p  in  destabilizing  Asp-pgal  in 
vivo  (Fig.  6A),  Asp-Pgal  from  cells  expressing  ATEl-lp  was 
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TABLE  1.  N-terminal  sequencing  of  X-pgal  proteins  isolated  from 
ateli^  ubrlA  S.  cerevisiae  expressing  different  R-transferases 


Substrate 

Coexpressed 

protein 

N-terminal 

sequence 

Yield  (%) 

D-e*^-(3gal 

D-H-G-S-A- 

D-e*^-|3gaI 

Vector  alone 

D-H-G-S-A- 

'-100 

D-e*^-pgal 

Mouse  ATEl-lp 

R-D-H-G-S-A- 

'-100 

D-e^-(3gal 

Mouse  ATEl-2p 

R-D-H-G-S-A- 

~50 

D-H-G-S-A- 

-50 

C-e^-pgal 

C-H-G-S-A- 

C-e^-pgal 

Vector  alone 

C-H-G-S-A- 

-20 

C-e^-pgal 

Mouse  ATEl-lp 

OH-G-S-A- 

-20 

C-e^-Pgal 

Mouse  ATEl-2p 

C-H-G-S-A- 

-20 

found  to  be  completely  arginylated,  whereas  Asp-pgal  from 
cells  expressing  ATEl-2p  was  arginylated  to  approximately 
50%  (Table  1). 

We  also  determined,  using  the  above  procedure,  whether 
mouse  ATEl-lp  or  ATEl-2p  could  arginylate  N-terminal  Cys. 
Approximately  80%  of  Cys-pgal  isolated  from  atelA  ubrlA  S, 
cerevisiae  was  found  to  be  N-terminally  blocked,  presumably 
acetylated  (Table  1).  However,  the  rest  of  Cys-Pgal  ('-'20%) 
bore  the  N-terminal  sequence  beginning  with  Cys  and  lacking 
N-terminal  Arg,  in  agreement  with  the  results  of  the  in  vivo 
Cys-Pgal  degradation  assays  (Fig.  6  and  Table  1).  Thus,  both 
ATEl-lp  and  Atel-2p  are  apparently  unable  to  utilize  N- 
terminal  Cys  as  a  substrate  in  5.  cerevisiae, 

ATEl-2p  is  exclusively  cytosolic,  whereas  ATEl-lp  is 
present  in  either  the  nucleus  or  the  cytosol.  To  determine  the 
intracellular  location  of  the  two  forms  of  mouse  R-transferase, 


we  constructed  fusions  to  the  N  terminus  of  GFP  and  tran¬ 
siently  expressed  them  in  NIH  3T3  cells.  Whereas  the  free 
26-kDa  GFP  was  located  in  both  the  nucleus  and  the  cytosol 
(data  not  shown),  the  85-kDa  Atel-2p-GFP  fusion  was  exclu¬ 
sively  cytosolic  in  all  of  the  many  transfected  cells  examined 
(Fig.  7a  to  c).  In  contrast,  the  85-kDa  ATEl-lp-GFP  (the 
alternative  form  of  R-transferase  that  is  much  more  active 
enzymatically  than  ATEl-2p)  was  found  to  be  localized  differ¬ 
ently  in  different  cells  on  the  same  coverslip,  possibly  depend¬ 
ing  on  their  cell  cycle  position  and/or  metabolic  state.  Specif¬ 
ically,  in  —50%  of  the  transfected  cells,  ATEl-lp-GFP  was 
exclusively  cytosolic  (Fig.  7d  and  e),  as  was  ATEl-2p-GFP 
(Fig.  7a  to  c),  but  in  the  other  --50%  of  cells,  ATEl-lp-GFP 
was  present  in  the  nucleus  as  well  and,  moreover,  appeared  to 
be  significantly  enriched  in  the  nucleus  (Fig.  7f  and  g).  Thus, 
the  two  43-residue  alternative  regions  in  ATEl-lp  and 
ATEl-2p  confer  overlapping  but  nonidentical  intracellular  dis¬ 
tributions  on  the  respective  R-transferases. 

While  the  nonuniformity  of  the  ATEl-lp-GFP  localization 
among  mouse  cells  in  a  single  culture  remains  to  be  under¬ 
stood,  its  preferential  location  in  the  nuclei  of  some  cells  is 
consistent  with  a  high  content  of  basic  residues  in  its  43-residue 
region,  in  comparison  to  the  alternative  homologous  region  of 
ATEl-2p  (Fig.  2C).  (No  sequences  fitting  the  consensus  se¬ 
quences  of  known  nuclear  localization  signals  could  be  de¬ 
tected  in  the  43-residue  region  of  ATEl-lp).  In  contrast  to 
mouse  R-transferases,  S.  cerevisiae  Ate  Ip  was  shown  to  be 
located  predominantly  in  the  nuclei  of  yeast  cells  (56a). 

The  ratio  of  Atel~l  to  Atel-2  mRNA  varies  greatly  among 
mouse  tissues.  Northern  hybridization,  using  the  1.1-kb  mouse 
Atel  cDNA  fragment  (nt  638  to  1734)  as  a  probe,  detected  a 


FIG.  7.  Intracellular  localization  of  mouse  ATBl-lp  and  ATEl-2p.  Shown  are  green  (GFP)  fluorescence  (a  to  d  and  f)  and  phase -contrast  (e  and  g)  micrographs 
of  mouse  NIH  3T3  cells  transiently  transfected  with  ATEl-lp-GFP  (d  to  g)  or  ATEl-2p-GFP  (a  to  c)  fusion  proteins  (see  Materials  and  Methods).  Panels  a  to  c  show 
different  examples  of  the  exclusively  cytosolic  localization  of  ATEl-2p.  Regions  around  the  nucleus  and  in  the  lamellar  protrusions  at  the  edges  of  a  cell  (c)  exhibit 
higher  GFP  fluorescence,  possibly  because  of  a  greater  thickness  of  cells  in  these  areas.  Panels  d  plus  e  and  f  plus  g  show  pairs  of  GFP  fluorescence  and  phase-contrast 
pictures  of  cells  that  express  ATEl-lp-GFP,  The  cell  in  panels  d  and  e  shows  ATEl-lp-GFP  in  the  cytosol  but  not  in  the  nucleus.  Cells  in  panels  f  and  g  contain 
ATEl-lp-GFP  in  both  the  cytosol  and  the  nucleus,  the  latter  being  apparently  enriched  in  ATEl-lp-GFP. 
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FIG.  8.  Northern  hybridization  analyses  of  mouse  Atel  mRNA.  The  North¬ 
ern  blots  of  mRNA  from  different  mouse  tissues  were  probed  with  dx\Atel  cDNA 
fragment  (nt  638  to  1734)  which  can  hybridize  to  both  forms  of  Atel  mRNA.  A 
mouse  p-actin  cDNA  probe  was  used  for  comparing  the  total  RNA  loads  as 
described  previously  (19).  The  same  blot  was  also  hybridized  with  mouse  Ubrl 
(29)  (see  the  introduction).  The  apparent  absence  of  Atel  and  Ubrl  mRNAs 
from  the  spleen  is  an  artifact  of  RNA  degradation  in  this  lane  of  the  blot  (data 
not  shown).  Also  shown  are  the  results  of  analogous  Northern  hybridizations  of 
the  mouse  Ntanl  cDNA,  encoding  the  Asn-specific  Nt-amidase  (Fig.  lA),  and 
E2i4k,  encoding  the  relevant  Ub-conjugating  (E2)  enzyme  (19).  The  approxi¬ 
mate  sizes  of  transcripts  are  indicated  on  the  right. 


single  ~5.0*kb  transcript  (a  mixture  of  the  Atel-1  and  Atel-2 
mRNAs)  in  all  of  the  mouse  tissues  examined  except  the  testis, 
where  the  ---S-kb  Atel  mRNA  was  a  minor  one,  the  major 
species  being  ~2  kb  (Fig.  8).  Both  ATElp  and  the  other 
targeting  components  of  the  mammalian  N-end  rule  pathway 
are  expressed  ubiquitously  (at  various  levels),  and  the  testis- 
specific  patterns  of  transcripts  are  characteristic  for  all  of  them 
as  well  (Fig.  8).  The  existence  of  the  Y-chromosome-encoded, 
testis-specific  variant  of  the  Ub-activating  (El)  enzyme  (27, 35) 
suggests  that  the  testis-specific  modifications  of  the  N-end  rule 
pathway  may  be  functionally  relevant  in  spermatogenesis. 

To  determine  the  ratio  oiAtel-l  \o  Atel-2  mRNA  in  differ¬ 
ent  mouse  tissues  or  cells  in  culture,  we  employed  RT-PCR, 
using  sequence  differences  between  the  two  alternative,  ho¬ 
mologous  129-bp  exons  to  distinguish  between  them  (see  Ma¬ 
terials  and  Methods)  (Fig.  3).  Approximately  equal  amounts  of 
Ate  1-1  and  Atel-2  ml^As  were  present  in  mouse  embryonic 
fibroblasts  and  in  human  293  cells  in  culture  (Fig.  3  and  data 
not  shown).  However,  the  molar  ratio  of  Atel-1  to  Atel-2 
mRNA  was  found  to  vary  greatly  among  the  mouse  tissues:  it 
was  ~0.1  in  the  skeletal  muscle,  -^0.25  in  the  spleen,  ~3.3  in 
the  liver,  and  brain,  and  —10  in  the  testis  (Fig,  3).  Thus,  while 
the  total  expression  oiAtel  {Atel-1  Atel-2)  varies  by  2-  to 
4-fold  among  mouse  tissues  (Fig,  8),  the  difference  in  expres¬ 
sion  levels  between  and  Atel-2  mRNAs  can  be  as  high 

as  a  100-fold  (the  skeletal  muscle  versus  the  testis)  (Fig.  3), 
suggesting  that  the  two  forms  of  R-transferase  may  be  func¬ 
tionally  distinct. 


DISCUSSION 

The  N-end  rule  pathway  is  one  of  several  proteolytic  path¬ 
ways  of  the  Ub  system  (23,  54,  55).  Among  the  targets  of  the 
N-end  rule  pathway  are  proteins  that  bear  destabilizing  N- 
terminal  residues.  In  the  yeast  S.  cerevisiae,  Asn  and  Gin  are 
tertiary  destabilizing  N-terminal  residues  in  that  they  function 
through  their  conversion,  by  a  specific  amidase  (6),  to  the 
secondary  destabilizing  N-terminal  residues  Asp  and  Glu.  The 
destabilizing  activity  of  N-terminal  Asp  and  Glu  requires  their 
conjugation,  by  the  -encoded  R-transferase,  to  Arg,  one 
of  the  primary  destabilizing  residues  (7)  (Fig.  IB).  In  mam¬ 
mals,  the  set  of  secondary  destabilizing  residues  contains  not 
only  Asp  and  Glu  but  also  Cys,  which  is  a  stabilizing  residue  in 
yeast  (18,  54)  (Fig.  1). 

In  this  work,  we  isolated  cDNA  encoding  the  mouse  R- 
transferase,  ATElp,  and  found  that  this  enzyme  exists  in  two 
forms,  termed  ATEl-lp  and  ATEl-2p,  which  differ  by  con¬ 
taining  one  of  the  two  alternative,  homologous  43-residue  re¬ 
gions.  The  two  516-residue  R- transferases  are  produced  from 
the  mouse  Atel  gene  by  a  pathway  of  alternative  splicing  that 
retains  one  or  the  other  of  the  two  homologous  129-bp  exons. 
The  presence  of  two  adjacent,  homologous,  equal-length,  and 
alternatively  utilized  exons  in  a  gene  (Fig.  4)  is  nearly  unprec¬ 
edented.  To  our  knowledge,  just  one  such  case  was  described 
previously:  the  mouse  kE2  enhancer-binding  protein  E12/E47 
(37).  The  two  KE2-binding  proteins,  El 2  and  E47,  are  pro¬ 
duced  through  a  switch  between  two  alternative,  equal-length 
exons,  resulting  in  two  helix-loop-helix  DNA-binding  proteins 
that  differ  in  the  ability  to  homodimerize.  Specifically,  E47  can 
bind  to  the  kE2  enhancer  either  as  a  homodimer  or  as  a 
heterodimer  with  MyoD,  whereas  E12  can  bind  as  a  het¬ 
erodimer  with  MyoD  but  not  as  a  homodimer  (37). 

We  report  the  following  major  findings. 

(i)  Identification,  through  species  walking,  and  isolation  of 
the  mouse  cDNA  encoding  R-transferase  (or  ATElp)  have 
shown  that  mammalian  ATElp  exists  in  two  forms,  ATEl-lp 
and  ATEl-2p,  which  differ  exclusively  by  one  of  the  two  alter¬ 
native,  homologous  43-residue  regions  (Fig.  2A). 

(ii)  The  corresponding  alternative  129-bp  exons  are  adjacent 
in  the  mouse  Atel  gene.  Moreover,  the  12-bp  sequences 
around  the  splice  acceptor  sites  of  these  exons  (6  bp  in  the 
intron  and  6  bp  in  the  exon)  are  identical  between  the  two 
exons  (Fig.  4).  The  splicing  of  Atel  pre-mRNA  proceeds  in 
such  way  that  one,  and  only  one,  of  the  alternative  129-bp 
exons  is  always  retained  in  the  mature  Atel  mRNA. 

(iii)  The  human  gene  also  contains  the  two  alternative 
129-bp  exons,  whereas  the  plant  {A,  thaliana)  and  fly  {D.  mela- 
nogaster)  Atel  genes  encode  a  single  form  of  ATElp  (Fig.  2 
and  5).  The  corresponding  43-residue  regions  are  significantly 
similar  among  all  of  the  sequenced  R-transferases,  from  S, 
cerevisiae  to  mammals  (Fig.  2C).  The  set  of  Atel  genes  from 
mammals  to  yeast  defines  a  distinct  family  of  proteins,  the 
ATE  family.  The  splicing-derived  alternative  forms  of  R-trans¬ 
ferase  have  evolved  apparently  after  the  divergence  of  the 
arthropod  and  vertebrate  lineages. 

(iv)  Expression  of  the  moust  Atel-1  and  Atel-2  cDNAs  in 
atel  A  S.  cerevisiae,  and  N-terminal  sequencing  of  isolated 
X-pgal  test  proteins,  was  used  to  show  that  ATEl-lp  and 
ATEl-2p  could  implement  the  Asp/Glu-specific  subset  of  the 
N-end  rule  pathway  and  that  they  did  so  through  the  argin- 
ylation  of  N-terminal  Asp  or  Glu  in  the  test  substrates  (Fig.  6A 
and  Table  1). 

(v)  While  the  destabilizing  activity  of  the  mouse  ATEl-lp 
R-transferase  is  only  slightly  lower  than  that  of  5.  cerevisiae 
R-transferase,  the  activity  of  mouse  ATEl-2p  is  significantly 
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(possibly  considerably)  lower  than  that  of  ATEl-lp.  This  con¬ 
clusion  follows  also  from  a  comparison  of  the  N-terminal  ar- 
ginylation  of  Asp-Pgal  by  the  two  R-transferases  (Table  1). 
The  results  of  coexpressing  mouse  ATEl-lp  and  ATEl-2p  in 
the  same  atelL  yeast  cells  were  consistent  with  the  conjecture 
that  R-transferase  functions  as  a  monomer  (Fig.  6B), 

(vi)  Neither  ATEl-lp  nor  ATEl-2p  could  confer  instability 
on  (or  arginylate)  Cys-Pgal  in  atelb^  5.  cerevisiae  (Fig.  6 A  and 
Table  1).  Cys  is  a  stabilizing  residue  in  yeast  but  a  secondary 
destabilizing  residue  in  the  mammalian  N-end  rule  (54).  A 
distinct  Cys-specific  mammalian  R-transferase  suggested  by 
these  data  remains  to  be  identified. 

(vii)  Mouse  ATEl-2p  (tested  as  a  GFP  fusion)  was  exclu¬ 
sively  cytosolic  in  mouse  3T3  cells,  whereas  ATEl-lp  was  lo¬ 
calized  differentially  in  different  cells  of  the  same  (unsynchro¬ 
nized)  culture:  it  was  either  exclusively  cytosolic  or  present  in 
both  the  cytosol  and  the  nucleus  (Fig.  7). 

(viii)  Mouse  Atel  is  a  ubiquitously  expressed  gene.  A  single 
~5-kb  mRNA  was  present  in  all  of  the  tissues  examined  except 
the  testis,  where  the  major  transcript  was  ~2  kb  in  length 
(Fig.  8).  The  testis-specific  differential  expression  patterns  are 
also  characteristic  of  the  other  targeting  components  of  the 
mammalian  N-end  rule  pathway,  such  as  the  M^^ni-encoded 
Asn-specific  Nt-amidase  and  tie  L^r7-encoded  N-recognin 
(E3a)  (19,  29). 

(ix)  The  molar  ratio  oiAtel-1  XoAtehl  mRNA  varies  up  to 
a  100-fold  among  different  mouse  tissues  (Fig.  3  and  Results), 
suggesting  a  functional  significance  of  the  difference  between 
the  two  R-transferases. 

The  region  of  ATElp  that  corresponds  to  the  two  129-bp 
mammalian  Atel  exons  has  been  significantly  conserved 
throughout  eukaryotic  evolution,  Tyr-296,  Gln-297,  and  His- 
301  of  the  mouse  ATEl-lp  being  among  the  most  highly  con¬ 
served  residues  (Fig.  2C),  No  putative  members  of  the  ATE 
family  could  be  detected  among  the  currently  known  prokary¬ 
otic  ORFs.  The  most  highly  conserved  region  of  R-transferases 
is  an  82-residue  stretch  (residues  336  to  417)  of  mouse  ATElp: 
this  region  is  95,  76,  and  63%  identical  to  the  corresponding 
regions  of  the  human,  D.  melanogaster,  and  A  thaliana  ATElp, 
respectively  (Fig.  2B).  A  Cys  residue(s)  is  likely  to  be  a  com¬ 
ponent  of  the  active  site  of  R-transferase  (31,  32).  Among  the 
five  fully  conserved  Cys  residues  in  proteins  of  the  ATE  family, 
four  are  located  in  the  56-residue  N-terminal  region  (residues 
23  to  78  of  mouse  ATElp)  (Fig.  2B).  Conversion  of  some  of 
these  cysteines  in  5.  cerevisiae  Atelp  to  alanines  was  found  to 
decrease  greatly  the  R-transferase  activity  of  yeast  Atelp  (16a, 
32).  Furthermore,  derivatives  of  mouse  ATEl-lp  and  ATEl-2p 
that  lacked  the  first  42  residues  were  completely  inactive  in  the 
yeast-based  Asp-pgal  degradation  assay  of  a  kind  described  in 
Fig.  7  (data  not  shown).  Finally,  a  90-residue  C-terminal  tnm- 
cation  of  5.  cerevisiae  Atelp  did  not  result  in  a  major  decrease 
of  its  R-transferase  activity  (16a).  Thus,  the  active  site  of  R- 
transferase  is  likely  to  encompass  at  least  some  of  the  above 
N-terminal  cysteines. 

Since  the  two  mammalian  R-transferases  (Fig.  4C)  are  iden¬ 
tical  in  size  and,  except  for  a  43-residue  region,  are  identical 
otherwise  as  well,  it  is  likely  that  the  previously  described 
(partially  purified)  mammalian  R-transferases  (12,  47)  were  in 
fact  mixtures  of  Atel -Ip  and  Atel-2p.  On  the  other  hand, 
fractionation  of  a  crude  R-transferase  preparation  from  rabbit 
reticulocytes  did  yield,  in  addition  to  a  major  fraction  of  R- 
transferase,  chromatographically  distinct  R-transferase  frac¬ 
tions  that  were  not  investigated  further  (12).  The  ratio  of 
Ate  1-1  p  to  Atel-2p  in  rabbit  reticulocytes  is  currently  un¬ 
known. 

A  splicing-mediated  switch  that  replaces  the  129-bp  Atel 


exon  of  ATEl-lp  with  the  alternative  129-bp  exon  results  in  a 
protein,  ATEl-2p,  that  has  a  significantly  (possibly  consider¬ 
ably)  lower  R-transferase  activity  (Fig.  6).  In  addition, 
ATEl-2p  is  unable  to  enter  the  nucleus  (as  a  GFP  fusion),  in 
contrast  to  ATEl-lp  (Fig.  7).  Taken  together  with  the  finding 
that  the  expression  ratio  of  the  XsNoAtel  mRNASy  Atel -1  and 
Atel-2,  varies  up  to  a  100-fold  among  different  mouse  tissues 
(Fig.  3  and  Results),  these  data  suggest  that  the  two  R-trans¬ 
ferases  are  functionally  distinct  as  well.  Cited  below  are  some 
of  the  possibilities  that  are  consistent  with  the  available  evi¬ 
dence. 

Mouse  ATEl-2p  has  the  R-transferase  activity  but  arginyl- 
ates,  at  steady  state,  only  —50%  of  Asp-^gal  in  atelL  S.  cer¬ 
evisiae,  in  contrast  to  both  ATEl-2p  and  S.  cerevisiae  Atelp 
(Fig.  6  and  Table  1).  Moreover,  the  inefficient  arginylation  by 
ATEl-2p  occurs  in  spite  of  its  overexpression  in  S.  cerevisiae. 
In  contrast,  the  yeast  Atelp,  which  in  wild-type  5.  cerevisiae  is 
a  weakly  expressed  protein  (7),  can  quantitatively  arginylate  in 
vivo  an  overexpressed  substrate  such  as  Asp-Pgal  (18).  Thus,  at 
a  low  level  of  expression  (which  is  likely  to  be  the  case  in  the 
mouse),  the  ATEl-2p  R-transferase  may  be,  in  effect,  an  in¬ 
active  enzyme,  in  contrast  to  ATEl-lp,  If  so,  ATEl-2p  might 
act  as  an  (indirect)  inhibitor  of  the  ATEl-lp  function,  for 
example,  through  a  competition  with  ATEl-lp  for  the  binding 
to  a  component  of  the  targeting  complex  in  the  N-end  rule 
pathway.  (The  apparent  absence  of  such  competition  in  5. 
cerevisiae  [Fig.  6B]  may  result  from  the  lack  of  binding  by 
ATElp  to  heterologous  yeast  proteins.)  It  is  also  possible  that 
a  large  difference  in  activity  between  mouse  ATEl-lp  and 
ATEl-2p  in  yeast  reflects  not  their  different  enzymatic  activi¬ 
ties  in  the  mouse  but  a  (physiologically  irrelevant)  differential 
recognition  of  an  essential  yeast  cofactor  such  as  Arg-tRNA. 
Direct  comparisons  of  arginylation  kinetics  by  the  purified 
mouse  and  yeast  R-transferases  will  be  required  to  address  this 
unlikely  but  unexcluded  interpretation. 

Another  possibility  is  that  ATEl-2p  has  a  distinct  enzymatic 
activity  that  has  been  missed  by  the  current  N-terminal  ar¬ 
ginylation  assay  (Fig.  6  and  Table  1).  For  example,  ATEl-2p 
might  be  able  to  arginylate  an  internal  residue  in  a  substrate 
protein.  In  vitro  enzymological  dissection  of  ATEl-lp  and 
ATEl-2p  will  address  this  and  related  conjectures.  Yet  an¬ 
other  possibility  is  that  the  alternative  43-residue  regions  of 
ATEl-lp  and  ATEl-2p  confer  different  metabolic  stabilities 
on  the  two  R-transferases,  the  lower  apparent  activity  of 
ATEl-2p  in  yeast  being  due  at  least  in  part  to  its  shorter 
half-life.  A  test  of  this  model  in  mouse  cells  requires  antibodies 
specific  for  the  alternative  regions  of  the  two  R-transferases; 
preparation  of  such  antibodies  is  under  way. 

In  yeast,  the  N-end  rule  pathway  is  present  in  both  the 
cytosol  and  the  nucleus.  The  apparent  exclusion  of  mouse 
ATEl-2p  from  the  nucleus  and  the  different  ratios  of  Atel-1  to 
Ate  1-2  mRNA  among  the  mouse  tissues  suggest  that  the  rule 
book  of  the  N-end  rule  pathway  may  be  regulated  differentially 
in  the  cytosol  cuid  the  nucleus,  through  a  cell-type-specific 
expression  of  the  pathway’s  components  that  are  located  in  one 
but  not  the  other  compartment. 

Physiological  substrates  for  either  eukaryotic  R-transferases 
(54)  or  their  prokaryotic  counterparts,  L,  F-transferases  (1,  25, 
45)  are  not  known.  The  cloning  and  characterization  of  the  first 
mammalian  Atel  cDNAs  and  genes  (Fig.  2),  and  the  discov¬ 
ery  of  alternative  splicing  that  yields  mouse  ATEl-lp  and 
ATEl-2p  (Fig.  4)  should  facilitate  understanding  of  the  func¬ 
tions  of  mammalian  R-transferases,  in  part  through  the  anal¬ 
ysis  of  ATEl-lp  and  ATEl-2p  enzymes  and  also  because  it  is 
now  possible  to  construct  mouse  strains  that  lack  ATEl-lp 
and/or  ATEl-2p. 
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The  N-end  rule  relates  the  in  vivo  half-life  of  a  protein 
to  the  identity  of  its  N-terminal  residue.  Ubrlp,  the  rec¬ 
ognition  (E3)  component  of  the  Saccharomyces  cerevi- 
siae  N-end  rule  pathway,  contains  at  least  two  substrate¬ 
binding  sites.  The  type  1  site  is  specific  for  N-terminal 
basic  residues  Arg,  Lys,  and  His.  The  type  2  site  is  spe¬ 
cific  for  N-terminal  bulky  hydrophobic  residues  Phe, 
Leu,  Trp,  Tyr,  and  He.  Previous  work  has  shown  that 
dipeptides  bearing  either  type  1  or  type  2  N-terminal 
residues  act  as  weak  but  specific  inhibitors  of  the  N-end 
rule  pathway.  We  took  advantage  of  the  two-site  archi¬ 
tecture  of  Ubrlp  to  explore  the  feasibility  of  bivalent 
N-end  rule  inhibitors,  whose  expected  higher  efficacy 
would  result  from  higher  affinity  of  the  cooperative  (bi¬ 
valent)  binding  to  Ubrlp.  The  inhibitor  comprised 
mixed  tetramers  of  ^-galactosidase  that  bore  both  N- 
terminal  Arg  (type  1  residue)  and  N-terminal  Leu  (type  2 
residue)  but  that  were  resistant  to  proteolysis  in  vivo. 
Expression  of  these  constructs  in  S.  cerevisiae  inhibited 
the  N-end  rule  pathway  much  more  strongly  than  the 
expression  of  otherwise  identical  /3-galactosidase  tet¬ 
ramers  whose  N-terminal  residues  were  exclusively  Arg 
or  exclusively  Leu.  In  addition  to  demonstrating  spatial 
proximity  between  the  type  1  and  t3rpe  2  substrate-bind¬ 
ing  sites  of  Ubrlp,  these  results  provide  a  route  to  high 
affinity  inhibitors  of  the  N-end  rule  pathway. 


Among  the  targets  of  the  N-end  rule  pathway  are  intracel¬ 
lular  proteins  bearing  destabilizing  N-terminal  residues  (1,  2). 
This  proteolytic  pathway  is  one  of  several  pathways  of  the 
ubiquitin  (Ub)^  system,  whose  diverse  functions  include  the 
regulation  of  cell  growth,  division,  differentiation,  and  re¬ 
sponses  to  stress  (3-6).  Uh  is  a  76-residue  eukaryotic  protein 
that  exists  in  cells  either  free  or  conjugated  to  other  proteins. 
Many  of  the  Ub-dependent  regulatory  circuits  involve  proces- 
sive  degradation  of  ubiquitylated  proteins  by  the  26  S  protea- 
some,  an  ATP-dependent  multisubunit  protease  (7,  8). 

The  N-end  rule  is  organized  hierarchically.  In  the  yeast 
Saccharomyces  cerevisiae,  Asn  and  Gin  are  tertiary  destabiliz¬ 
ing  N-terminal  residues  in  that  they  function  through  their 
conversion,  by  the  NTAl-eneoded  N-terminal  amidase,  into  the 
secondary  destabilizing  N-terminal  residues  Asp  and  Glu.  The 
destabilizing  activity  of  N-terminal  Asp  and  Glu  requires  their 
conjugation  by  the  ATEi-encoded  Arg-tRNA-protein  transfer¬ 
ase  (R-transferase)  to  Arg,  one  of  the  primary  destabilizing 
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residues  (reviewed  in  Refs.  1  and  9).  In  mammals,  two  distinct 
N-terminal  amidases  specific,  respectively,  for  N-terminal  Asn 
or  Gin  mediate  the  conversion  of  these  tertiary  destabilizing 
residues  into  the  secondary  destabilizing  residues  Asp  or  Glu 
(10,  11).  The  set  of  secondary  destabilizing  residues  in  verte¬ 
brates  contains  not  only  Asp  and  Glu  but  also  Cys,  which  is  a 
stabilizing  residue  in  yeast  (9,  12,  13). 

The  primary  destabilizing  N-terminal  residues  are  bound 
directly  by  N-recognin,  the  E3  (recognition)  component  of  the 
N-end  rule  pathway.  In  S.  cerevisiae,  N-recognin  is  the  UBRl- 
encoded  225-kDa  protein  that  binds  to  potential  N-end  rule 
substrates  through  their  primary  destabilizing  N-terminal  res¬ 
idues:  Phe,  Leu,  Trp,  T3nr,  lie,  Arg,  Lys,  and  His  (1,  14).  The 
Ubrl  genes  encoding  mouse  and  human  N-recognin  (also  called 
E3a)  have  been  cloned  as  well  (15).  N-recognin  has  at  least  two 
substrate-binding  sites.  The  type  1  site  is  specific  for  the  basic 
N-terminal  residues  Arg,  Lys,  and  His.  The  type  2  site  is 
specific  for  the  bulky  hydrophobic  N-terminal  residues  Phe, 
Leu,  Trp,  Tyr,  and  He  (1, 12, 16, 17),  N-recognin  can  also  target 
short-lived  proteins  such  as  Cup9p  (18)  and  Gpalp  (19,  20), 
which  lack  destabilizing  N-terminal  residues.  The  Ubrlp-rec- 
ognized  degradation  signals  of  these  proteins  remain  to  be 
characterized  in  detail. 

The  known  functions  of  the  N-end  rule  pathway  include  the 
control  of  di-  and  tripeptide  import  in  S.  cerevisiae  through  the 
degradation  of  Cup9p,  a  transcriptional  repressor  of  the  pep¬ 
tide  transporter  gene  PTR2  (18,  21);  a  mechanistically  unde¬ 
fined  role  in  the  Slnlp-dependent  phosphorylation  cascade  that 
mediates  osmoregulation  in  S.  cerevisiae  (22);  the  degradation 
of  Gpalp,  a  Ga  protein  of  S.  cerevisiae  (19,  20);  and  the  condi¬ 
tional  degradation  of  alphaviral  RNA  polymerase  in  virus- 
infected  metazoan  cells  (23).  Physiological  N-end  rule  sub¬ 
strates  were  also  identified  among  the  proteins  secreted  into 
the  cytosol  of  the  host  cell  by  intracellular  parasites  such  as  the 
bacterium  Listeria  monocytogenes  (24).  Short  half-lives  of  these 
proteins  are  required  for  the  efficient  presentation  of  their 
peptides  to  the  immune  system  (24).  A  partial  inhibition  of  the 
N-end  rule  pathway  was  reported  to  interfere  with  mammalian 
cell  differentiation  (25)  and  to  delay  limb  regeneration  in  am¬ 
phibians  (26).  Recent  evidence  suggests  that  the  N-end  rule 
pathway  mediates  a  large  fraction  of  the  muscle  protein  turn¬ 
over  (27)  and  plays  a  role  in  catabolic  states  that  result  in 
muscle  atrophy  (28). 

Targeted  mutagenesis  has  been  used  to  inactivate  the  N-end 
rule  pathway  in  Escherichia  coli  and  S.  cerevisiae  (14,  29). 
Analogous  mutants  have  recently  been  constructed  in  the 
mouse  as  well.^  These  approaches  notwithstanding,  an  effica¬ 
cious  inhibitor  of  the  N-end  rule  pathway  would  be  useful  as 
well,  especially  with  organisms  less  tractable  genetically.  The 
emerging  understanding  of  the  N-end  rule  pathway  in  mam¬ 
mals  suggests  that  selective  inhibition  or  activation  of  this 
proteolytic  system  may  also  have  medical  applications.  Previ- 
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ous  work  has  shown  that  millimolar  concentrations  of  amino 
acid  derivatives  such  as  dipeptides  bearing  destabilizing  N- 
terminal  residues  can  selectively  inhibit  the  N-end  rule  path¬ 
way  in  extracts  from  rabbit  reticulocytes  (12,  17)  and  Xenopus 
eggs  (13),  and  in  intact  S.  cerevisiae  cells  as  well  (16).  However, 
the  same  dipeptides  were  observed  to  have  at  most  marginal 
effects  on  the  N-end  rule  pathway  in  intact  mammalian  cells.^ 
One  limitation  of  dipeptide  inhibitors  is  their  apparently  low 
affinity  for  the  type  1  and  the  type  2  site  of  N-recognin  (30). 

In  the  present  work,  we  explored  the  possibility  that  a  biva¬ 
lent  ligand  can  bind  simultaneously  to  the  type  1  and  type  2 
sites  of  N-recognin  (see  Fig.  lA).  Similarly  to  the  previously 
characterized  bivalent  interactions  that  involve  either  macro¬ 
molecules  or  small  molecules  (31,  32),  the  cooperativity  of  bind¬ 
ing  at  two  independent,  mutually  nonexclusive  sites  would  be 
expected  to  increase  the  affinity  between  N-recognin  and  a 
bivalent  inhibitor  by  orders  of  magnitude,  in  comparison  with 
the  affinity  of  a  monovalent  binding  by  the  same  compound.  We 
show  that  a  bivalent  inhibitor  of  the  N-end  rule  pathway  is 
feasible  and  consider  the  implications  of  this  advance. 

EXPERIMENTAL  PROCEDURES 

Strains  and  General  Techniques — The  S.  cerevisiae  strains  used  were 
JD52  {MATa  ura3-52  his3-i^00  leu2-3,112  trpl-l!^63  lys2-801)  and 
JD55  {MATa  ura3-52  his3-^200  leu2-3,112  trpl-ti63  lys2-801  ubrl 
A:HIS3)  (19,  33).  Cells  were  grown  on  rich  (YPD)  or  synthetic  medium 
containing  either  2%  dextrose  (SD  medium),  2%  galactose  (SG  medi¬ 
um),  or  2%  raffinose  (SR  medium)  (34).  To  induce  the  Pcupi  promoter, 
CUSO4  was  added  to  a  final  concentration  of  0.1  mM.  Transformation  of 
S.  cerevisiae  was  carried  out  using  the  lithium  acetate  method  (35). 

Plasmids — The  high  copy  (2/jL-based)  plasmids  pRA-j3gal-TRPl  and 
pRA-j3gal-HIS3,  which  expressed  Arg-e^^-j3gal  (Ub-Arg-e^^-j8gal)  (see 
Fig.  2A)  from  the  galactose-inducible  Pcyci/gali  hybrid  promoter  (2), 
were  produced  by  replacing  the  URA3  marker  gene  of  pFL7  with  either 
TRPl  or  HIS3.  pLA-j3gal-TRPl  and  pLA-^gal-HIS3,  both  of  which  ex¬ 
pressed  Leu-e^^-jBgal  (Ub-Leu-e^^-/3gal),  were  produced  by  replacing 
the  Ub-Arg  domain  of  pRA-/3gal-TRPl  and  pRA-j3gal-HIS3  with  Ub-Leu 
domain  of  the  pLL2  plasmid.^  The  plasmid  pFL7  was  produced  from 
pUB23-R  (2)  by  converting  the  lysine  codons  15  and  17  of  the  extension 
e^  into  arginine  codons  (36,  37),  yielding  a  construct  encoding  the 
extension  e^^  in  front  of  a  j3gal  moiety  lacking  the  first  23  residues  of 
wild  type  /3gal  (see  Fig.  2A).  The  low  copy,  pRS315  vector-derived  (38) 
plasmid  pR-eAKhaUra3-R3R7  expressed  Arg-e^^-ha-Ura3p^^^’*^'^^  (Ub- 
Arg-e^^-ha-Ura3p^^*^’^'^^)  from  the  Pcupi  promoter.  Arg-e^^^-ha- 
Ura3p*^^^’^^^  (see  Fig.  2B)  is  called  Arg-Ura3p  in  the  main  text.  In  this 
N-end  rule  substrate,  the  residues  Lys-3  and  Lys-7  of  the  S.  cerevisiae 
UraSp  were  converted  to  arginines  (see  “Results  and  Discussion”).  In 
addition,  the  ha  epitope  tag  (39)  was  placed  between  e^^  and 
Ura3p^^^’*^''^  (see  Fig.  2B).  The  plasmid  pR-eAKhaUra3-R3R7  was 
produced  from  pFLl  (encoding  Ub-Arg-e^^-ha-Ura3p)  through  site-di¬ 
rected  mutagenesis  of  the  URA3  codons  for  Lys-3  and  Lys-7.  pFLl  was 
produced  from  pKM1235  (which  encoded  Ub-Arg-e*^-ha-Ura3p)®  by  con¬ 
verting  the  e*^-coding  sequence  into  the  one  encoding  e^^. 

Pulse-Chase  and  Plating  Efficiency  Assays — Pulse-chase  assays  with 
S.  cerevisiae  in  mid-exponential  growth  (Agoo  of  ^^1)  utilized  ^^S-EX- 
PRESS  (NEN  Life  Science  Products)  and  were  carried  out  as  described 
previously  (10,  19),  including  the  immunoprecipitation  with  anti-)3gal 
and  anti-ha  antibodies  and  quantitation  with  a  Phosphorlmager  (Mo¬ 
lecular  Dynamics,  Sunn3rvale,  CA).  To  determine  plating  efficiency,  S. 
cerevisiae  strains  JD52  {UBRl)  and  JD55  {ubrl A)  expressing  Arg- 
Ura3p  (lJb-Arg-e^^-ha-Ura3p^^^’*^^^;  see  Fig.  2B)  were  co-transformed 
with  plasmids  indicated  in  the  legend  to  Fig.  3.  The  transformants  were 
cultured  in  the  raffinose-based  medium  (SR)  lacking  Leu,  His,  and  Trp 
for  20  h.  The  cultures  were  then  diluted  into  the  otherwise  identical 
galactose-containing  (SG)  medium  to  a  final  Ag^o  of  0.1.  At  an  Agoo  of 
0.4,  cultures  were  either  supplemented  with  0.1  mM  CUSO4  or  left 
unsupplemented.  At  the  Agoo  of  1.0,  the  cultures  were  diluted  with  SG 
(which  lacks  Leu,  His,  and  Trp)  either  containing  or  lacking  0.1  mM 
CUSO4  and  were  plated  on  the  plates  of  the  same  medium  composition 
that  also  either  contained  or  lacked  uracil.  The  plating  efficiency  (%) 


^  F.  Levy  and  A.  Varshavsky,  unpublished  data. 

^  M.  Ghislain  and  A.  Varshavsky,  unpublished  data. 
^  K.  Madura  and  A.  Varshavsky,  unpublished  data. 


Bivalent  Inhibitor  Test  substrate 


Fig.  1.  The  concept  of  a  bivalent  inhibitor  of  the  N-end  rule 
pathway.  A,  the  t3q)e  1  and  type  2  sites  of  S.  cerevisiae  Ubrlp  (N- 
recognin),  which  are  specific,  respectively,  for  the  basic  (Arg,  Lys,  and 
His)  and  bulky  hydrophobic  (Phe,  Leu,  Trp,  Tyr,  and  lie)  N-terminal 
residues.  In  the  diagram,  the  type  1  and  type  2  sites  are  occupied  by 
their  ligands,  the  N-terminal  Arg  and  Leu,  borne  by  a  heterodimeric 
bivalent  inhibitor  (actually,  a  tetrameric  /3gal-based  protein  in  the 
present  work).  A  test  substrate  bearing  Arg,  a  type  1  destabilizing 
N-terminal  residue  is  shown  as  well.  The  test  substrate,  in  contrast  to 
the  protein-based  inhibitor,  bears  at  least  one  internal  Lys  residue  (not 
indicated  in  the  diagram)  that  can  function  as  a  component  of  the 
N-degron.  The  t3rpe  1  and  t3q)e  2  sites  of  N-recognin  are  shown  located 
close  together  in  the  N-terminal  region  of  the  225-kDa  Ubrlp.  The 
recent  genetic  dissection  of  the  Ubrlp  substrate-binding  sites®  placed 
the  type  1  and  t5rpe  2  sites  close  together  in  the  ~60-kDa  N-terminal 
region  of  the  225-kDa  Ubrlp.  B,  a  diagram  illustrating  the  expected 
frequencies  of  heterodimeric  (Arg-  and  Leu-bearing)  dimers  within  a 
^gal-based  bivalent  inhibitor.  Specifically,  at  equal  levels  of  expression 
of  the  two  jSgal-based  polypeptide  chains,  50%  of  jSgal  tetramers  would 
be  expected  to  be  heterotetramers  in  which  at  least  one  of  the  two 
dimers  bears  different  (Arg  and  Leu)  N-terminal  residues.  In  the  jSgal 
tetramer,  the  two  N  termini  of  each  dimer  are  spatially  close,  exposed, 
and  oriented  in  the  same  direction  (40).  See  also  “Results  and 
Discussion.” 


was  defined  as  the  ratio  of  the  number  of  colonies  on  SG  (-Leu,  -His, 
—Trp,  — Ura)  plates  to  the  number  of  colonies  on  SG  (—Leu,  —His, 
-Trp)  plates,  at  the  same  concentration  of  CUSO4.  For  each  measure¬ 
ment,  colonies  on  15  plates  were  counted  to  yield  the  average  number  of 
colonies  per  plate. 


RESULTS  AND  DISCUSSION 

We  constructed  a  bivalent  N-end  rule  inhibitor  (Fig.  lA)  from 
the  previously  studied  N-end  rule  substrates  derived  from  E. 
coli  jSgal  (2).  In  eukaryotes,  linear  Ub-protein  fusions  are  rap¬ 
idly  cleaved  by  deubiquitylating  enzymes  at  the  Ub-protein 
junction,  making  possible  the  production  of  otherwise  identical 
proteins  bearing  different  N-terminal  residues,  a  technical  ad¬ 
vance  that  led  to  the  finding  of  the  N-end  rule  (2).  A  |3gal-based 
N-end  rule  substrate  contains  a  destabilizing  N-terminal  resi¬ 
due  (produced  in  vivo  using  the  Ub  fusion  technique  (1));  a 
~45-residue,  E.  coli  Lac  repressor-derived  N-terminal  exten¬ 
sion  called  e^  (extension  e  bearing  lysines  K);  and  the  /3gal 
moiety  lacking  its  first  21  residues.  The  resulting  X-e^-j3gal  is 
a  short-lived  protein  in  both  yeast  and  mammalian  cells, 
whereas  an  otherwise  identical  protein  bearing  a  stabilizing 
N-terminal  residue  such  as  Met  or  Val  is  metabolically  stable 
(1,  2).  An  N-degron  comprises  a  destabilizing  N-terminal  resi¬ 
due  and  a  Lys  residue  (or  residues),  the  latter  being  the  site  of 
formation  of  a  multi-Ub  chain  (1,  36).  (Ubrlp  can  also  recognize 
a  set  of  other,  internal  degrons,  which  remain  to  be  character¬ 
ized  (18).)  If  Lys-15  and  Lys-17  of  the  e^  extension  are  replaced 
by  the  Arg  residues  (which  cannot  be  ubiquitylated),  the  result¬ 
ing  X-e^^-/3gal  (Fig.  2A)  is  long-lived  in  vivo  even  if  its  N- 
terminal  residue  is  destabilizing  in  the  N-end  rule  (1,  37). 

In  the  present  work,  we  used  the  metabolically  stable  Arg- 
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Fig.  2.  Designs  of  bivalent  inhibitor  and  test  substrate.  A,  the 

/3gal -based  fusions  (the  residue  X  was  either  Arg  or  Leu)  used  to 
construct  the  Arg/Leu-bearing  bivalent  inhibitor.  The  Ub  moiety  of  the 
fusions  was  cotranslationally  removed  in  vivo  by  deubiquitinating  en¬ 
zymes  (1).  The  ~45-residue,  E.  coli  Lac  repressor-derived  sequence 
termed  e^*^  (extension  (e)  lacking  lysines  (A/D),  is  described  in  the  main 
text.  The  j3gal  part  of  the  fusion  lacked  the  first  21  residues  of  wild  type 
/3gal  (2).  By  the  UraSp-based  N-end  rule  substrate,  Arg-e^^-ha- 
UraSp^^^’^"^^,  derived  from  Ub-Arg-e^^-ha-UraSp*^^^’^’^’^  and  denoted 
Arg-Ura3p,  is  described  in  the  main  text. 


e^*^-/3gal  (produced  from  Ub-Arg-e^^-/3gal)  and  Leu-e^^-j3gal 
(produced  from  Ub-Leu-e^^-j8gal).  These  proteins  retain  the 
ability  to  bind,  respectively,  to  the  type  1  and  t57pe  2  sites  of 
N-recognin  but  cannot  be  ubiquitylated  (37),  apparently  be¬ 
cause  the  most  N-terminal  Lys  residue  in  X-e^^-j3gal,  at  posi¬ 
tion  239,  is  too  far  from  the  N  terminus  of  the  protein.  In  the 
/3gal  tetramer,  the  two  N  termini  of  each  dimer  are  spatially 
close,  exposed,  and  oriented  in  the  same  direction  (40).  At  equal 
levels  of  expression  of  the  two  )3gal-based  polypeptide  chains 
such  as  Arg-e^^-j3gal  and  Leu-e^^-/3gal,  50%  of  /3gal  tetramers 
would  be  expected  to  be  heterotetramers  in  which  at  least  one 
of  the  two  dimers  bears  different  (Arg  and  Leu)  N-terminal 
residues  (Fig.  LB).  If  the  type  1  and  type  2  substrate-binding 
sites  of  the  225-kDa  Ubrlp  are  appropriately  located  and  ori¬ 
ented,  they  might  be  able  to  bind  the  Arg-  and  Leu-bearing 
subunits  of  the  mixed  j3gal  tetramer,  especially  in  view  of  the 
presumed  flexibility  of  the  e^^  extension  (1)  (Fig.  L4). 

The  reporter  N-end  rule  substrate  in  this  study  was  Arg-e^^- 
ha-Ura3p®^^®^’*^^*^,  denoted  below  as  Arg-Ura3p  (Fig.  2B).  This 
ha-tagged,  type  1  N-end  rule  substrate  was  produced  from 
Ub-Arg-e^^-ha-Ura3p*^^^’*^^*^  through  the  cotranslational  in 
vivo  cleavage  by  deubiquitinating  enzymes  (1,  6,  41).  The  ly¬ 
sine-lacking  e^*^  extension  of  Arg-e^^-ha-Ura3p^^^’^^^,  and 
the  replacement  of  the  first  two  lysines  of  the  Ura3p  moiety 
with  arginines  were  used  to  decrease  the  rate  of  degradation  of 
Arg-Ura3p  by  the  N-end  rule  pathway  and  also  to  reduce  the 
slow  but  detectable  degradation  of  Arg-Ura3p  by  yet  another 
pathway,  through  a  degron  distinct  from  the  N-degron.^  Sev¬ 
eral  Lys  residues  of  UraSp  other  than  Lys-3  and  Lys-7  are  also 
close  to  its  N  terminus,  thus  accounting  for  the  absence,  in  this 
case,  of  the  all-or-none  effect  on  the  reporter  degradation  that 
is  observed  when  e^  is  replaced  with  e^^  in  an  X-e*^-j3gal 
substrate  (37).  The  Lys-3  ^  Arg  and  Lys-7  — ^  Arg  modifications 
decreased  the  enzymatic  activity  of  the  Ura3p  moiety.^  The 
reduced  enz3Tnatic  activity  of  Ura3p*^^^’^^^  facilitated  selec¬ 
tion  assays  (Figs.  3  and  4). 

The  first  bivalent  inhibitor  assay  employed  ura3  S.  cerevisiae 
expressing  Arg-Ura3p  (Fig.  2B)  from  the  uninduced  Pcupi  Pro¬ 
moter.  The  Ubrlp-mediated  degradation  of  Arg-Ura3p  of 
~8  min)  and  its  correspondingly  low  steady-state  concentration 
rendered  wild  type  (UBRl)  cells  phenotj^pically  Ura“,  whereas 
ubrlA  strains  expressing  Arg-Ura3p  were  phenotypically  Ura"^ 
(Figs.  3  and  4  and  data  not  shown).  Cells  expressing  Arg-Ura3p 
were  cotransformed  with  two  control  plasmids  (vectors; 


SG(-Leu,-ffis,-Trp,-Ura) 

C+Cu-^) 


SG(-Leu,-His,-Ttp,-Ura) 


SG(-Leu,-His,-Trp) 

(-Cu*^^) 


Fig.  3.  A  bivalent,  but  not  monovalent,  inhibitor  of  Ubrlp  con¬ 
fers  Ura"^  phenotype  on  cells  expressing  the  short-lived  N-end 
rule  substrate  Arg-Ura3p.  S.  cerevisiae  JD52  {UBRl)  expressing 
Arg-Ura3p  (Ub-Arg-Ura3p),  a  short-lived  reporter  (Fig.  2B),  were  co¬ 
transformed,  alternatively,  with  pRS423  (i/7S3-based  control  vector) 
and  pRS424  (Pgalu  TBPi -based  control  vector)  (denoted  as  423+424); 
with  pRS424  and  pRA-j3gal-HIS3,  expressing  Arg-e^^-/3gal  (Ub-Arg- 
e'^^-pgal)  (denoted  as  R+424;  monovalent  inhibitor);  with  pRS423  and 
pLA-jSgal-TRPl,  expressing  Leu-e^^-j3gal  (Ub-Leu-e^^-/3gal)  (denoted 
as  423  +L);  with  pRA-/3gal-HIS3  and  pRA-j3gal-TRPl,  both  expressing 
Arg-e^^-/3gal  (denoted  as  R+R;  monovalent  inhibitor);  with  pLA-j3gal- 
HIS3  and  pLA-pgal-TRPl,  both  expressing  Leu-e^^-j3gal  (denoted  as 
L+L);  or  with  pRA-j3gal-HIS3  and  pLA-j3gal-TRPl,  expressing  Arg-e^^- 
/3gal  and  Leu-e^*^-j3gal  (denoted  as  R+L;  the  bivalent  inhibitor).  Cells 
were  streaked  on  SG  medium  containing  0.1  mM  CUSO4  and  lacking 
Leu,  His,  Trp,  and  Ura  {upper  left  paneDy  on  the  otherwise  identical 
medium  lacking  the  added  CUSO4  {upper  right  paneDy  or  on  the  Ura- 
containing  SG  medium  lacking  Leu,  His,  and  Trp  (controls;  lower  left 
panel)-  Plates  were  incubated  at  30  “C  for  3  days. 


423  +424  in  Fig.  3).  Alternatively,  these  cells  were  cotrans¬ 
formed  with  two  plasmids  (bearing  different  selectable  mark¬ 
ers)  that  expressed  either  Arg-e^*^-j3gal  alone  {R+R  in  Fig.  3), 
Leu-e^^-)3gal  alone  (L-f-L  in  Fig.  3),  or  both  of  them  together 
{R+L  in  Fig.  3;  the  bivalent  inhibitor  mode)  from  a  galactose- 
inducible  promoter.  Pairs  of  alternatively  marked  plasmids 
were  used  to  make  certain  that  the  conditions  of  expression  and 
the  total  amounts  of  jSgal-based  proteins  produced  remained 
the  same  in  all  of  these  settings.  The  transformants  were 
streaked  on  SG  medium  lacking  uracil. 

Remarkably,  only  those  Arg-Ura3p-expressing  cells  that  ex¬ 
pressed  both  Arg-e^^-j3gal  and  Leu-e^^-)3gal  became  Ura^  un¬ 
der  these  conditions  (Fig.  3).  The  cells  that  expressed  either 
Arg-e^^-jSgal  alone  or  Leu-e^®^-j3gal  alone  remained  Ura“,  as 
did  the  cells  that  received  control  plasmids  (Fig.  3).  (The  same 
cells  grew  equally  well  in  the  control  SG  medium  containing 
uracil  (Fig.  3,  bottom  left  panel).)  Note  that  the  monovalent 
inhibitors  were  ineffective  despite  the  fact  that  the  concentra¬ 
tion  of  either  the  Arg-based  N  terminus  alone  or  the  Leu-based 
N  terminus  alone  was  twice  the  concentration  of  the  same  N 
termini  in  the  case  of  the  bivalent  inhibitor. 

To  quantify  the  effect  of  coexpressing  Arg-e^^-j8gal  and  Leu- 
e^^-jSgal  on  the  rescue  of  the  Ura^  phenotype,  a  plating  effi¬ 
ciency  assay  was  carried  out  with  the  same  transformants. 
Equal  amounts  of  cells  were  plated  on  SG(+Ura)  and 
SG(— Ura)  plates,  and  the  numbers  of  colonies  were  deter¬ 
mined.  When  the  Arg-Ura3p  reporter  was  expressed  at  a  suf¬ 
ficiently  low  rate  (uninduced  Pcupi  promoter),  cells  became 


18138 


Bivalent  Inhibitor 


Fig.  4.  Plating  efficiencies  of  S.  cerevisiae  expressing  Arg- 
UraSp  in  the  presence  of  bivalent  and  monovalent  inhibitors  of 
the  N-end  rule  pathway.  A,  S.  cerevisiae  JD52  (UBRl)  and  JD55 
(ubrlA)  expressing  Arg-Ura3p  were  cotransformed  with  the  sets  of 
plasmids  described  and  denoted  in  the  legend  to  Fig.  3.  The  transfor¬ 
mants  were  cultured  as  described  under  “Experimental  Procedures” 
and  plated  on  either  SG(-Leu,  -His,  -Trp,  -Ura)  plates  or  control 
plates  SG(-Leu,  -His,  -Trp)  containing  0.1  mM  CUSO4.  The  plating 
efficiencies  shown  are  the  values  produced  by  normalization  against  the 
absolute  plating  efficiency  (92%)  of  the  positive  control:  the  u6riA 
strain  JD55  expressing  Arg-Ura3p  and  bearing  the  vector  pRS424.  S, 
the  same  experiment  was  done  using  plates  lacking  the  added  CUSO4. 
The  plating  efficiencies  shown  are  the  values  produced  by  normaliza¬ 
tion  against  the  positive  control  used  in  A.  Under  these  growth  condi¬ 
tions  (no  added  CUSO4),  the  absolute  plating  efficiency  of  the  positive 
control  was  26%. 


Ura"^  (through  metabolic  stabilization  of  Arg-Ura3p)  only  in 
the  presence  of  both  Arg-e^^-)3gal  and  Leu-e'^^-jSgal  (Fig.  4B).  A 
weak  stabilizing  effect  of  Arg-e^^-/3gal  alone  could  be  detected 
only  at  a  ~ 20-fold  higher  level  of  Arg-Ura3p  expression  (in¬ 
duced  Pcupi  promoter)  (Fig.  4A).  No  stabilization  of  Arg-Ura3p 
was  observed  in  the  presence  of  Leu-e^^-jSgal  under  any  con¬ 
ditions  (Fig.  4),  confirming  the  specificity  of  inhibition  in  regard 
to  the  type  (basic  or  bulky  hydrophobic)  of  the  primary  desta¬ 
bilizing  N-terminal  residue  of  the  reporter.  Higher  sensitivity 
of  this  assay  at  the  higher  level  of  Arg-Ura3p  expression  results 
from  a  higher  steady-state  level  of  the  short-lived  Arg-Ura3p, 
so  that  even  its  marginal  stabilization  suffices  to  render  a  small 
fraction  of  cells  Ura'^  (Fig.  4A;  compare  with  Fig.  4B). 

To  analyze  directly  the  in  vivo  degradation  of  Arg-Ura3p  in 
the  presence  of  different  combinations  of  X-e^^-j3gal  proteins, 
the  transformants  of  Figs.  3  and  4  were  subjected  to  pulse- 
chase  analysis,  with  immunoprecipitation  of  both  Arg-Ura3p 
and  the  (long-lived)  X-e^^-jSgals  (Fig.  5).  Quantitation  of  the 
resulting  electrophoretic  patterns  (Fig.  5C)  confirmed  and  ex¬ 
tended  the  conclusions  reached  through  phenotypic  analyses 
(Figs.  3  and  4).  Specifically,  the  normally  short-lived  Arg- 
Ura3p  (Fig.  5A,  lanes  1-3)  was  strongly  (but  still  incompletely) 
stabilized  in  the  presence  of  both  Arg-e^*^-j3gal  and  Leu-e^^- 
j8gal  (Fig.  5A,  lanes  4-6;  compare  with  lanes  1-3  and  7-9).  This 
stabilization  was  manifested  especially  clearly  as  an  increase 
in  the  relative  amount  of  Arg-Ura3p  at  the  beginning  of  chase 
(time  0),  indicating  reduced  degradation  of  Arg-Ura3p  during 
the  pulse  (Fig.  5C).  This  latter  degradation  pattern,  termed 
“zero  point  effect,”  is  caused  by  the  previously  demonstrated 
preferential  targeting  of  newly  formed  (as  distinguished  from 
conformationally  mature)  protein  substrates  by  the  N-end  rule 
pathway  (16,  42).  The  increased  steady-state  level  of  Arg- 
Ura3p  in  the  presence  of  both  Arg-e^*^-/3gal  and  Leu-e^^-j3gal 
accounted  for  the  results  of  phenotypic  analyses  (Figs.  3  and  4). 
The  much  smaller  but  detectable  stabilization  of  Arg-Ura3p  by 
Arg-e^^-j3gal  alone  (Fig.  5C)  was  consistent  not  only  with  the 
inability  of  Arg-e^^-j3gal  to  confer  the  Ura"^  phenotype  on  cells 
expressing  Arg-Ura3p  from  uninduced  Pcupi  promoter  but  also 
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Fig.  5.  Metabolic  stabilization  of  Arg-Ura3p  in  the  presence  of 
bivalent  N-end  rule  inhibitor.  A,  S.  cerevisiae  JD52  {UBRl)  express¬ 
ing  Arg-Ura3p,  a  short-lived  Ura3p-based  reporter  (Fig.  2B),  from  the 
induced  Pcupi  promoter,  were  cotransformed,  alternatively,  with  either 
pRS423  (H7iS3-based  control  vector)  and  pRS424  (Pgalu  TRPi -based 
control  vector)  (denoted  as  423+424),  or  with  pRA-j3gal-TRPl  and  pLA- 
j8gal-HIS3,  expressing  Arg-e^^-^gal  and  Leu-e^^-/3gal  (denoted  as  R+L; 
the  bivalent  inhibitor).  Control  JD55  (ubrlA)  cells  expressing  Arg- 
Ura3p  were  transformed  with  pRS424.  Cells  grown  in  either  dextrose- 
containing  SD  medium  (no  expression  of  ^gal)  or  galactose-containing 
SG  medium  were  labeled  with  [^^S]  methionine/cysteine  for  5  min  at 
30  ®C,  followed  by  a  chase  for  0,  10,  and  30  min,  extraction,  immuno¬ 
precipitation,  and  SDS-10%  polyacrylamide  gel  electrophoresis.  B, 
same  as  in  A,  but  cells  were  also  cotransformed  with  the  plasmids 
pRA-j3gal-TRPl  and  pRA-^gal-HIS3,  both  expressing  Arg-eAK-/3gal  (de¬ 
noted  as  R+R).  The  assays  were  carried  out  in  SG  medium.  C,  in  vivo 
decay  curves  of  Arg-Ura3p  (Fig.  2B)  in  wild  type  (JD52)  and  ubrlA 
(JD55)  cells.  The  patterns  in  A  and  B  were  quantified  as  described 
under  “Experimental  Procedures.”  The  initial  amounts  of  Arg-Ura3p 
were  normalized  against  the  amount  in  ubrlA  cells  (100%).  Note  that 
the  inhibitors  also  altered  the  zero  point  effect  (degradation  of  a  re¬ 
porter  during  the  pulse  (16,  42)).  •,  degradation  of  Arg-Ura3p  in  UBRl 
(JD52)  cells  in  the  presence  of  both  Arg-e^^-j3gal  and  Leu-e^^-j3gal  (cells 
were  grown  in  SG  medium);  O,  the  same  transformants  were  grown  in 
SD  medium  where  jSgal  fusions  were  not  expressed;  ▼,  JD52  cells 
expressing  Arg-Ura3p  were  transformed  with  the  two  alternatively 
marked  plasmids  expressing  Arg-e^^-/3gal  and  grown  in  SG  medium;  □, 
JD52  cells  expressing  Arg-Ura3p  were  transformed  with  the  two  alter¬ 
natively  marked  control  vectors  and  grown  in  SG  medium;  ■,  ubrlA 
(JD55)  cells  expressing  Arg-Ura3p  were  transformed  with  control  vec¬ 
tors  and  grown  in  SG  medium. 


with  the  partial  rescue  of  the  Ura'^  phenotype  by  Arg-e^^-/3gal 
in  cells  expressing  Arg-Ura3p  from  the  induced  Pcupi  (Figs.  3 
and  4  and  data  not  shown). 
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The  Arg/Leu-e'^^-iSgal-based  bivalent  inhibitor  of  the  present 
work,  although  surprisingly  potent  (Fig.  4B),  is  obviously  far 
from  optimal  even  for  a  protein-based  inhibitor;  because  jSgal  is 
a  homotetramer,  only  —50%  of  the  coexpressed  Arg-e^^-)3gal 
and  Leu-e'^^^-Pgal  chains  would  exist  as  heterodimers  within 
tetramers  (Fig.  IB).  (This  estimate  assumes  a  random  assort¬ 
ment  of  Arg-  and  Leu-bearing  )3gal  chains  in  the  formation  of 
/3gal  tetramers.  The  actual  in  vivo  assortment  is  expected  to  be 
biased,  to  an  unknown  extent,  in  favor  of  homodimeric  associ¬ 
ations,  because  individual  polysomes  would  produce  jSgal 
chains  bearing  either  Arg  or  Leu  but  not  both.)  In  addition, 
although  the  e^^  extension  (Fig.  2A)  is  capable  of  supporting 
the  desired  effects,  it  is  also  unlikely  to  be  optimal.  In  sum¬ 
mary,  the  efficacy  of  this  first  and  necessarily  suboptimal  bi¬ 
valent  inhibitor  bodes  well  for  the  future  of  this  design. 

A  bivalent  inhibitor  is  strikingly  more  efficacious  than  an 
otherwise  identical  monovalent  inhibitor  (Figs.  3-5).  In  addi¬ 
tion,  our  findings  are  the  first  evidence  that  the  type  1  and  type 
2  sites  of  N-recognin  are  spatially  proximal  in  the  225-kDa  S. 
cerevisiae  Ubrlp.  While  this  work  was  under  way,  genetic 
dissection  of  S.  cerevisiae  Ubrlp  identified  amino  acid  residues 
that  are  required  for  the  integrity  of  the  type  1  site  but  not  the 
type  2  site,  and  vice  versa. ^  These  results  provided  independent 
evidence  for  both  the  separateness  and  spatial  proximity  of  the 
two  substrate-binding  sites  of  the  225-kDa  N-recognin,  in 
agreement  with  the  present  data.  Our  results  (Figs.  3—5) 
strongly  suggest  that  small  bivalent  inhibitors  of  the  N-end 
rule  pathway  are  feasible,  and  moreover,  are  expected  to  be 
much  more  potent  than  their  monovalent  counterparts.  Work 
to  produce  such  inhibitors  is  under  way. 
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Analysis  of  a  conditional  degradation  signal  in  yeast  and  mammalian  cells 

Frederic  Levy*,  Jennifer  A.  Johnstont  and  Alexander  Varshavsky 

Division  of  Biology,  California  Institute  of  Technology,  Pasadena,  CA,  USA 

The  N-end  rule  pathway  is  a  ubiquitin-dependent  proteolytic  system,  the  targets  of  which  include  proteins  that  bear 
destabilizing  N>terminal  residues.  The  latter  are  a  part  of  the  degradation  signal  called  the  N-degron.  Arg-DHFR'\ 
an  engineered  N-end  rule  substrate,  bears  N-terminal  arginine  (a  destabilizing  residue)  and  DHFR^"'  [a  temperature-  . 
sensitive  mouse  dihydrofolate  reductase  (DHFR)  moiety].  Previous  work  has  shown  that  Arg-DHFR'"  is  long-lived 
at  23  °C  but  short-lived  at  37  °C  in  the  yeast  Saccharomyces  cerevisiae.  In  the  present  work,  we  extended  this 
analysis,  and  found  that  the  degradation  of  Arg-DHFR'’'  can  be  nearly  completely  inhibited  in  vivo  by  methotrexate 
(MTX),  a  low-A/r  ligand  of  DHFR.  In  S.  cerevisiae.  Arg-DHPR*""  is  degraded  at  37  °C  exclusively  by  the  N-end  rule 
pathway,  whereas  in  mouse  cells  the  same  protein  at  the  same  temperature  is  also  targeted  by  another  proteolytic 
system,  through  a  degron  in  the  conformationally  perturbed  DHFR'’'  moiety.  In  mouse  cells,  MTX  completely 
inhibits  the  degradation  of  Arg-DHFR'"  through  its  degron  within  the  DHFR'’'  moiety,  but  only  partially  inhibits 
degradation  through  the  N-degron.  When  the  N-terminus  of  Arg-DHFR'’'  was  extended  with  a  42-residue  lysine- 
lacking  extension,  termed  the  resulting  Arg-e'^^-DHFR'’'  was  rapidly  degraded  at  both  23  '"C  and  37  ®C. 
Moreover,  the  degradation  of  Arg-e'^‘^-DHFR'\  in  contrast  with  that  of  Arg-DHFR'\  could  not  be  inhibited  by  MTX. 
suggesting  that  the  metabolic  stability  of  Arg-DHFR'’'  at  23  °C  results,  at  least  in  part,  from  steric  inaccessibility  of 
its  N-terminal  arginine.  The  N-degron  of  Arg-DHFR'"'  is  the  first  example  of  a  portable  degradation  signal  the 
activity  of  which  can  be  modulated  in  vivo  by  a  cell-penetrating  compound.  We  discuss  implications  of  this  advance 
and  the  mechanics  of  targeting  by  the  ubiquitin  system. 

Keywords:  degron;  methotrexate:  N-end  rule;  proteolysis;  ubiquitin. 


A  number  of  regulatory  circuits,  including  those  that  control  the 
cell  cycle,  cell  differentiation  and  responses  to  stress,  involve 
metabolically  unstable  proteins  [1-6].  A  short  in  vivo  half-life 
of  a  regulator  provides  a  way  to  generate  its  spatial  gradient  and 
allows  rapid  adjustments  of  its  concentration,  or  subunit 
composition,  through  changes  in  the  rate  of  its  synthesis.  A 
protein  can  also  be  conditionally  unstable,  i.e.  long-lived  or 
short-lived,  depending  on  the  state  of  the  cell.  One  example  of 
the  latter  class  are  cyclins,  a  family  of  related  proteins  the 
destruction  of  which  at  specific  stages  of  the  cell  cycle  regulates 
cell  division  and  growth  [3,7], 

Features  of  proteins  that  confer  metabolic  instability  are  called 
degradation  signals,  or  degrons  [8].  An  essential  component  of 
one  degradation  signal,  called  the  N-degron.  is  a  destabilizing 
N-terminal  residue  of  a  protein  [9,10].  A  set  of  N-degrons 
containing  different  N-terminal  residues  which  are  destabilizing 
in  a  given  cell  yields  a  rule,  termed  the  N-end  rule,  which  relates 
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the  in  vivo  half-life  of  a  protein  to  the  identity  of  its  N-ierminal 
residue.  In  eukaryotes,  the  N-end  rule  pathway  is  a  part  of  the 
ubiquitin  (Ub)  system.  Ub  is  a  76-residue  protein  whose 
covalent  conjugation  to  other  proteins  plays  a  role  in  a  multitude 
of  processes,  including  cell  growth,  differentiation  and  responses 
to  stress  [2,4,1 1-13].  In  many  of  these  settings,  Ub  acts  through 
routes  that  involve  the  degradation  of  Ub-protein  conjugates  by 
the  26S  proteasome,  an  ATP-dependent  multisubunit  protease 
[14,15]. 

In  eukaryotes,  linear  Ub-protein  fusions  are  rapidly  cleaved 
by  Ub-specific  proteases  (UBPs)  at  the  Ub-protein  junction, 
making  possible  the  production  of  otherwise  identical  proteins 
bearing  different  N-terminal  residues,  a  technical  advance  that 
led  to  the  discovery  of  the  N-end  rule  [9,10].  When  mouse 
dihydrofolate  reductase  (DHFR),  a  20-kDa  monomeric  protein, 
was  expressed  as  a  Ub-Arg-DHFR  fusion  in  the  yeast  Sac¬ 
charomyces  cerevisiae  at  30  ®C,  the  resulting  Arg-DHFR  (pro¬ 
duced  through  deubiquity lation)  was  long-lived  >  4  h), 
even  though  it  bore  N-terminal  Arg,  a  destabilizing  residue  [16]. 
By  contrast,  a  modified  protein.  Arg-e*^-DHFR  (produced  from 
Ub-Arg-e*^-DHFR),  which  bore  a  42-residue,  lysine-containing 
extension  (denoted  as  e*^)  between  the  N-terminal  Arg  residue 
and  the  first  (Val)  residue  of  DHFR,  was  short-lived  in  vivo  (/0.5 
of  10  min),  being  degraded  by  the  N-end  rule  pathway  [16]. 

This  and  other  evidence  indicated  that  the  N-degron  com¬ 
prises  two  essential  determinants:  a  destabilizing  N-terminal 
residue  and  an  internal  Lys  residue  (or  residues)  of  a  substrate. 
The  Lys  residue  is  the  site  of  attachment  of  a  multi-Ub  chain 
[16-18].  Arg-e^-DHFR  contains  a  complete  N-degron  because 
it  bears  both  an  N-terminal  destabilizing  residue  (Arg)  and,  in 
the  e*^  extension,  at  least  one  targetable  Lys  residue.  In  contrast, 
the  N-degron  of  Arg-DHFR  is  incomplete,  for  at  least  one  of  the 
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two  mutually  non-exclusive  reasons:  the  absence  of  Lys  residues 
accessible  to  the  targeting  complex  of  the  N-end  rule  pathway 
and/or  poor  accessibility  of  the  N-terminal  Arg  in  Arg-DHFR  to 
the  same  targeting  complex.  Since  the  20-kDa  mouse  DHFR 
contains  16  Lys  residues,  this  interpretation  presumes  that,  in  a 
folded  DHFR  molecule,  the  lysines  arc  ineffective  as  ubiquity- 
lation  sites,  because  of  the  lysine's  lack  of  mobility  and/or  its 
distance  from  a  destabilizing  N-terminal  residue  [lOJ. 

Pre\'ious  work  [19]  described  a  temperature-sensitive  (n) 
allele  of  DHre  that  was  used  to  construct  a  heat-inducible 
N-desron.  in  a  Ub- Arg-DHFR^  -Ura3p  fusion  which  contained 

cercvisioc  Ura3p  as  a  C-terminal  reporter.  The  Ub- Arg- 
DHFR''  moiety  of  this  fu.sion  was  identical  with  Ub-Arg-DHFR 
above,  except  for  a  Pro  — ►  Leu  alteration  at  position  66  of  the 
DHFR  moiety.  Arg-DHFR’'-Ura3p  was  long-lived  at  23  °C  but 
short-lived  at  37  °C  in  yeast.  Moreover,  the  Ub-Arg-DHFR'' 
module  of  Ub-Arg-DHFR''-Ura3p  was  shown  to  be  portable,  in 
that  linking  it  to  a  protein  of  interest  conferred  heat-inducible 
metabolic  instability  on  the  entire  fusion  [19]. 

In  addition  to  yielding  a  new  method  for  the  construction  of  ts 
mutants,  the  heat-inducible  N-degron  [19]  provided  an  approach 
to  analyzing  the  mechanism  of  targeting  by  the  N-end  rule 
pathway.  In  carrying  out  this  analysis,  described  below,  we  have 
found  that  the  degradation  of  Arg-DHFR''  can  be  inhibited  in 
vivo  by  methotrexate  (MTX).  a  low-M,  ligand  of  DHFR.  We 
also  show  that,  in  mouse  cells.  Arg-DHFR"  is  targeted  by  both 
the  N-end  rule  pathway  and  another  proteolytic  system,  whereas 
in  yeast  the  .same  test  protein  is  exclusively  an  N-end  rule 
.substrate.  The  N-degron  of  Arg-DHFR"  is  the  first  example  of  a 


portable  degradation  signal  whose  activity  can  be  modulated  in 
vivo  by  a  cell-penetrating  compound. 

MATERIALS  AND  METHODS 

Plasmids  for  expression  in  S.  cerevisiae 

The  plasmid  pJJRtd.  which  expressed  Ub-Arg-DHFR''-ha 
(Fig.  L  construct  I),  was  based  on  the  vector  pRS316  [20].  A 
I.O-kb  £coRI/f//«dIII  fragment  of  pPW58  [8]  was  ligated  into 
pRS316,  follow'ed  by  the  cloning  of  the  Prrpi  promoter- 
containing  0.4  kb  EcoRI/Sflfl  fragment  of  pPW48  [19]  into  the 
EroRI  site,  a  .step  that  yielded  pJJRtd.  pJJMtd.  which  expressed 
Ub-Met-DHFR‘'-ha,  was  prepared  by  ligating  the  large  Ageli 
AflW  fragment  of  pJJRtd  to  a  small  fragment  from  the  AgeMAjlW 
digest  of  pLGUbM-DHFRha  [21]. 

Plasmids  for  expression  in  mammalian  cells 

Mouse  DHFR  and  DHFR*'  open  reading  frames  bearing  the 
desired  restriction  sites  at  their  5'  and  3'  ends  were  produced 
using  PCR.  All  DHFR  and  DHFR''  moieties  were  flanked,  at 
their  C-termini,  with  the  ha  epitope  [22]  (Fig.  1 ).  and  were 
cloned  into  the  pRc/CMV  vector  (Invitrogen.  San  Diego.  CA, 
USA). 

Constructs  II  and  Ill  (Fig.  1)  were  produced  by  PCR 
amplification  of  a  fragment  encoding  Met-DHFR''  and  Arg- 
DHFR'',  respectively,  using  construct  I  as  a  template.  The  N- 
terminal  Met  of  construct  11  was  specified  by  the  PCR  primer 


Fig.  1.  Test  proteins.  Fusions  used  in  this  work 
contained  some  of  the  following  moieties:  the  Ub 
moiety,  either  N-terminal  (constructs  1,  TV-VII) 
or  placed  between  other  moieties  (constructs  II 
and  III):  an  Arg  residue  at  the  junction  between 
Ub  and  the  C-terminal  part  of  the  fusion 
(constructs  I.  III.  V  and  VII);  a  Met  residue  at  the 
same  junction  (constructs  11.  IV.  and  VI):  a  42- 
residue  £.  coli  Lac  repressor-derived  sequence, 
termed  e^^  [extension  (e)  lacking  lysines  (AK)1. 
between  Ub  and  the  reporter  part  of  the  fusion  (see 
the  main  text  and  [22.27])  (constructs  IV- VII): 
DHFRha,  a  mouse  DHre  moiety  extended  at  the 
C-ierminus  by  a  sequence  containing  the  hemag¬ 
glutinin-derived  ha  epitope  (constructs  I-VIl): 
DHFR'\  a  mutant  temperature-sensitive  DHFR 
moiety  that  differs  from  DHFR  by  the  Pro  — »  Leu 
alteration  at  position  66  [19]  (see  the  main  text). 
Amino  acid  residues  are  indicated  by  their  single¬ 
letter  abbreviations.  Residues  that  vary  between 
constructs  are  boxed  in  the  sequences  above  the 
diagrams. 


Ub  DHFRts  ha 

MQ . GG0HGSGIMV...L . KKGTYP...FL 

1  76  1 

H  PHFR*S  |ha|  Ub-Arg-DHFR»s 

DHFR  ha  DHFR*®  ha 

MV . KKGT . FL  MQ..R.GG0HGSG1MV...L . KKGTYP...FL 


186  1  76  1 

\  /  //. 


186 


1 1  M-  DHFR 

ha  I 

11. 

X 

Q 

ha 

III  M-  DHFR 

ha  “ 

II. 

X 

Q 

ha 

Ub 

eAK 

DHFR*® 

1 

dha-Ub-Met-DHFR<® 

dha-Ub-Arg-DHFR*» 


IV 

V 

VI 

VII 


MQ....GG0HG..R.R...RSG1MV...L . KKGTYP...FL 

1  76  1  15  17  186  ^ 

Ui 


DHFR*® 

ha 

DHFR*® 

ha 

DHFR 

ha 

DHFR 

ha 

Ub-Met-e**^DHFR‘s 

Ub-Arg-e^K-DHFR** 

Ub-Met-e^DHFR 

Ub-Arg-e^DHFR 


246  F.  Levy  et  al.  {Eur.  J.  Biochem.  259) 


€  FEBS  1999 


annealing  to  the  5'  end  of  the  fragment.  The  PCR  products  were 
digested  with  Xbal,  blunted  with  Klenow  Pol  I,  followed  by  a 
cut  with  SflrII.  A  fragment  bearing  an  open  reading  frame  was 
inserted  between  the  blunt-ended  Not]  site  and  the  .SacII  site  of 
the  plasmid  pRc/dhaUbMbgal  [23].  resulting  in  the  plasmids 
pRc/dha>Ub-Met>DHFR^"  and  pRc/dha-Ub-Arg-DHFR’".  Met- 
and  Arg-DHFR*^  contained  the  six-residue  sequence  His-Gly- 
Ser-Gly-IIe-Met  between  the  N-terminal  residue  and  Val.  the 
first  natural  residue  of  DHFR  (Fig.  1). 

To  produce  constiucts  IV- VII  (Fig.  1),  a  fragment  encoding 
Arg-DHFR'"  w^as  amplified  by  PCR.  using  construct  1  as 
template.  The  PCR  product  was  digested  with  Xbah  and  blunted 
w'ith  Klenow^  Pol  1.  followed  by  a  5fldl  cut.  The  resulting 
fragment  w^as  inseited  between  the  blunt-ended  Notl  site  and  the 
5^cll  site  of  pRc/UbSPgal.  yielding  pRc/Ub-Arg-DHFR'\  The 
latter  ser\'ed  as  a  vector  for  the  constructs  IV- VII.  Constructs 
rV  and  V  were  obtained  by  amplifying  fragments  encoding, 
respectively.  Met-e-^'^-DHFR''  and  Arg-e’^'^-DHFR''.  usins  the 
plasmid  pLG/Ub'""’-Val-e^'^DHFRha  [22]  as  a  templaterihe 
N-terminal  residues  Met  and  Arg  were  specified  by  a  PCR 
primer  annealing  to  the  5'  end  of  the  fragment.  Digestion  of  the 
PCR  products  with  5^vtil  and  P/7MI  yielded  a  fragment  coding 


for  Met-  or  Arg-e-^^^  and  the  first  39  residues  of  DHFR 
(DHFRi.^y).  (P/7M1  cleaves  the  DHFR'''  open  reading  frame 
upstream  of  the  mutated  codon  at  position  66.)  This  fragment 
was  inserted  in-frame  betw^een  the  SacWIPflMl  sites  of  the 
plasmid  pRc/Ub-Arg-DHFR‘\  vielding  plasmids  expressing  Ub- 
Met-c-^^-DHFR''  and  Ub-Arg-e-^^-DHFR''.  Constructs  VI  and 
VII  were  produced  by  exchanging  a  SnaBlIPflml  fragment  of 
pRc/dhaLlbNe*^DHFRha  with  the  SnaBlfPflM]  fragments 
derived  from  constructs  IV  and  V.  respectively.  This  step 
eliminated  the  fragment  encoding  dha-Ub-Asn-e‘'^-bHFR,_i^,  of 
the  original  plasmid,  replacino  it,  in-frame,  with  the  sequence 
encoding  either  Ub-Met-e^’'^-DHFR,_,3  or  Ub-Arg-e"^^- 
DHFRi_iv  (The  plasmid  pRc/dhaUbNe'^DHFRha  was  con¬ 
structed  by  replacing  the  region  encoding  Pgal  in  pRc/ 
dhaUbXPgal  [23]  with  the  region  encoding  DHFR.  ) 

Yeast  and  mouse  cell  cultures,  transfection,  and  pulse-chase 
analysis 

5.  cerevisiae  were  grown  and  manipulated  as  described 
previously  [24]  and  in  the  legend  to  Fig.  2.  Mouse  L  cells,  a 
fibroblast-like  cell  line  (ATCC  CCL  1.3.  American  Type 
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Fig.  2.  Inhibition  of  Arg-DHFR‘"  degradation  by  MIX  in  S.  cerevisiae.  (A)  Lane  M.  molecular  mass  markers.  Lanes  a-c,  JD47-13c  (UBRI)  cells  expressing 
Arg-DHFR  (Ub-Arg-DHFR'  )  were  labeled  for  4  min  at  23  °C  with  f  ^''Slmethionine/cysteine,  followed  by  a  chase  for  10  and  30  min  as  described  [40]  then 
extraction,  immunoprecipitation,  and  Tricine-based  SDS/PAGE  (129.  gel)  of  Arg-DHFR'^  [41].  Lanes  d-f,  same  as  lanes  a-c,  except  that  immediately  after  the 
4-min  pulse  cells  were  shifted  to  37  “C.  and  the  cha.se  was  carried  out  at  37  °C.  Lanes  g-i,  same  as  lanes  d-f,  but  an  independent  pulse-chase  experiment.  Lanes 
j-1,  same  as  lanes  g-i.  but  with  JD55  (ubrIA)  5.  cerevisiae.  Lanes  m-o.  same  as  lanes  g-i,  but  with  KMY613  {uhc2d)  S.  cerevisiae  strain  expressing  the 
plasmid-bome  UBC2  gene  [42].  Lanes  p-r,  same  as  lanes  m-o.  but  with  KMY6I3  (ubc2A)  lacking  the  plasmid-borae  UBC2.  Lanes  s-u,  JD47-nc  (UBRI)  cells 
expressing  Arg-DHFR"-ha-Cdc28p  (Ub-Arg-DHFR''-ha.Cdc28p)  [19]  were  labeled  and  processed  as  described  for  lanes  d-f.  Unes  v-x,  same  as  lanes  s-u,  but 
with  MTX  added  to  the  growth  medium  to  a  final  concentration  of  20  jiM  30  min  before  the  pulse.  (B)  Lane  M,  molecular  mass  markers.  Lanes  a-c  JD47-I3c 
(UBRI)  cells  expressing  Met-DHFR*^  (Ub-Met-DHFR'^)  were  labeled  for  4  min  at  23  T  with  [‘^-^Slmethionine/cysteine.  Immediately  thereafter,  the’cells  were 
shifted  to  37  °C,  followed  by  a  chase  at  37  X  for  10  and  30  min.  extraction,  immunoprecipitation.  and  SDS/PAGE  of  Met-DHFR‘".  Lanes  d-f.  same  as  lanes  a- 
c,  but  with  Arg-DHFR  .  Lanes  g-i,  same  as  lanes  a-c,  but  with  MTX  added  to  the  growth  medium  30  min  before  the  pulse  to  a  final  concentration  of  20  \iM. 
Lanes  j-1,  same  as  lanes  g-i.  but  with  Arg-DHFR‘\  Lanes  m-o,  same  as  lanes  a-c,  except  that  both  the  pulse  and  the  chase  were  at  37  X.  Lanes  p-r,  same  as 
lanes  m-o,  but  with  Arg.DHFR'\  Lanes  s-u,  same  as  lanes  m-o,  but  with  MTX  added  to  the  growth  medium  30  min  before  the  pulse  to  a  final  concentration  of 
20  p.M.  Lanes  v-x,  same  as  lanes  s-u.  but  with  Arg-DHFR'^ 
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Culture  Collection,  Rockville.  MD.  USA),  were  grown  as 
monolayers  in  Dulbecco's  modified  Eagle's  medium/F12 
medium,  supplemented  with  10%  fetal  bovine  serum,  antibiotics, 
2  mM  L-glutamine  and  20  mM  Hepes  (sodium  salt:  pH  7.3).  The 
cultures  were  regularly  checked  for  the  absence  of  mycoplasmas. 
Transient  transfections  and  pulse-chase  analyses  were  per¬ 
formed  as  described  [23J.  For  the  pulse-chases  at  23  °C.  cells 
were  transferred  to  this  temperature  30  min  before  labeling  tor 
10  min  with  0.1  mCi  (1  Ci  =  37  GBq)  of  [^^SJEXPRESS  (New 
England  Nuclear.  Boston,  MA.  USA).  Cells  were  chased  at 
either  23  T  or  37  °C  in  the  complete  medium  containing 
0.1  mg  mL"^  cycloheximide.  All  media  were  pre-equilibrated  at 
the  desired  temperature.  In  experiments  that  involved  MTX.  the 
drug  was  added  30  min  before  labeling  and  was  present  at  a 
final  concentration  of  20  fxM  (diluted  from  20  niM  MTX  in 
0.9  M  potassium/sodium  phosphate,  pH  7.3).  The  samples  were 
analyzed  by  SDS/PAGE  (15%  gel),  followed  by  autoradio¬ 
graphy  and  quantitation  using  a  Phosphorlmager  (Molecular 
Dynamics,  Sunnyvale,  CA,  USA). 

To  distinguish  between  partial  half-lives  that  approximate  the 
slopes  of  different  regions  of  a  non-exponential  decay  curv'e,  a 
generalized  half-life  term,  fo  ^’,  was  used,  where  0.5  denotes  the 
parameter's  half-life  aspect  and  y-c  denotes  the  relevant  time 
interval,  from  y  to  c  min  of  chase.  Another  term  [23],  initial 
decay  (ID),  expressed  as  a  percentage,  equals  1  minus  the  ratio 
of  the  amount  of  a  radiolabeled  X-DHFR  test  protein  (normal¬ 
ized  against  the  reference  protein.  Met-DHFR-Ub;  Fig.  1 )  at  the 
end  of  the  pulse  (time  0)  to  the  (normalized)  amount  of  a 
radiolabeled  long-lived  X-DHFR  such  as  Met-DHFR.  As  ID 
may  depend  on  the  duration  of  a  pulse,  a  superscript,  in  ID', 
invokes  the  pulse  time  explicitly  [23], 

RESULTS 

Inhibition  of  Arg-DHFR*''  degradation  by  MTX  in 
S.  cerevisiae 

The  major  test  protein  of  this  work  was  Arg-DHFR^^  (produced 
from  Ub-Arg-DHFR’'')  (Fig.  1,  construct  I),  identical  with  the 
Arg-DHFR’"  moiety  of  a  larger  fusion  described  in  the  intro¬ 
duction.  The  test  proteins  bore  C-terminal  ha.  a  hemagglutinin- 
derived  epitope  tag  recognized  by  a  monoclonal  antibody  [22] 
(Fig.  1 ;  the  ha  epitope  is  omitted  below  in  the  names  of  specific 
test  proteins).  For  the  analysis  in  S.  cerevisiae,  Arg-DHFR‘'’  was 
expressed  from  the  copper-inducible  Pcupi  promoter  in  a  low- 
copy  vector,  and  the  metabolic  fate  of  Arg-DHFR^'’  was  deter¬ 
mined  in  a  pulse-chase  assay  (see  Materials  and  methods). 
Similarly  to  the  previously  examined  Arg-DHFR‘'‘-Ura3p  and 
Arg-DHTR^'-CdciSp  [19],  Arg-DHFR^"  was  a  long-lived  protein 
at  23  °C  (r()  5  >  4  h)  but  a  short-lived  protein  at  37  °C  (/0.5  === 
4  min)  (Fig.  2 A.  lanes  a-c  vs.  d-f).  The  degradation  of  Arg- 
DHFR'^  at  37  °C  was  carried  out  by  the  N-end  rule  pathway,  as 
indicated  by  (a)  the  requirement  for  Ubrlp  (E3),  the  recognition 
component  of  this  pathway  (Fig.  2A,  lanes  g-i  vs.  j-1),  (b)  the 
requirement  for  Ubc2p  (E2),  the  relevant  Ub-conjugating  enzyme 
(Fig.  2A,  lanes  m-o  vs.  p-r),  and  (c)  the  long  half-life  of  Met- 
DHFR‘^  an  otherwise  identical  protein  that  bore  N-terminal 
Met.  a  stabilizing  residue,  instead  of  Arg  (Fig.  2B,  lanes  a-c  vs. 
d-f). 

Previous  work  [27]  has  shown  that  the  degradation  of  Arg-e*^- 
DHFR  by  the  N-end  rule  pathway  in  an  extract  from  rabbit 
reticulocytes  can  be  inhibited  by  the  folate  analog  MTX,  a  high- 
affinity  DHFR  ligand  (Xd  «  10  pM).  It  was  far  from  clear 
whether  MTX  would  have  the  same  effect  in  vivo,  in  part 
because  the  purified  N-end  rule  substrates  added  to  the  extract 
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contained  the  folded  DHFR  moieties  [27].  By  contrast,  a  nascent 
DHFR-based  substrate  is  unable  to  bind  MTX  until  after  the 
folding  of  DHFR  [28],  but  can  be  targeted  by  the  N-end  rule 
pathway  in  vivo  at  any  time,  possibly  even  during  translation 
[16]. 

The  metabolic  fate  of  Arg-DHFR'"  at  37  T  was  monitored  in 
5.  cerevisiae  in  the  absence  or  presence  of  20  p.M  MTX  in  the 
medium.  MTX  w^as  found  to  be  an  efficacious  inhibitor  of  Arg- 
DHFR'"  degradation  in  5.  cerevisiae  (Fig.  2B.  lanes  d-f  and  p-r 
vs.  J-1  and  v-x).  MTX  inhibited  the  degradation  of  Arg-DHFR’'' 
regardless  of  w^hether  it  was  added  to  the  growth  medium 
15  min  or  24  h  before  the  puLse-chase  a.ssay  (data  not  shown: 
the  grow'th  of  cells  was  not  affected  significantly  by  20  |XM 
MTX  for  at  least  24  h).  The  protective  effect  of  MTX  on  Arg- 
DHFR*'’  was  essentially  the  same  regardless  of  W'hether  the  pulse 
labeling  (followed  by  the  37  °C  chase)  w'as  carried  out  at  23 
(Fig.  2B,  lanes  d-f  vs.  J-1)  or  at  37  °C  (Fig.  2B,  lanes  p-r  vs. 
v-x). 

The  degradation  (at  .37  °C)  of  Arg-DHFR''‘-Cdc28p.  which 
differed  from  Arg-DHFR*"'  (produced  from  Ub-Arg-DHFR''')  by 
the  presence  of  the  Cdc28p  kinase  moiety,  w^as  also  inhibited  by 
MTX  (Fig.  2A,  lanes  s-u  vs.  v-x).  Arg-DHFR’''-Cdc28p,  which 
was  long-lived  at  23  °C.  or  at  37  °C  in  ubrl A  cells,  could 
function  as  a  Cdc28p  kinase  in  vivo  [19j.  As  Arg-DHFR^''- 
Cdc28p  was  short-lived  at  37  °C  in  UBRI  S.  cerevisiae 
(Fig.  2A,  lanes  s-u),  the  cells  that  also  carried  a  mutant  (/5) 
cdc28  allele  [19]  were  not  viable  at  37  °C,  owing  to  the  absence 
of  the  essential  Cdc28p  kinase  at  this  temperature.  The  addition 
of  MTX  (at  20  |XM)  w^as  found  to  rescue  these  cells  at  37  °C 
through  the  inhibition  of  degradation  of  Arg-DHFR^''-Cdc28p 
(Fig.  2 A,  lanes  s-u  vs.  v-x,  and  data  not  shown).  The  effect  of 
MTX  was  specific  for  DHFR-containing  substrates,  as  degrada¬ 
tion  of  unrelated  test  proteins,  for  example  Arg-Pgal  (Ub-Arg- 
pgal),  by  the  N-end  rule  pathway  was  unimpaired  in  the 
presence  of  MTX  (data  not  shown). 

DHFR  as  an  N-end  rule  substrate  in  mammalian  cells 

DHFR-based  substrates  of  the  N-end  rule  pathway  were  used  in 
the  earlier  studies  with  5.  cerevisiae  [16,19],  but  have  not  been 
examined  in  mammalian  cells.  We  used  L  cells,  a  mouse 
fibroblast- 1  ike  cell  line,  and  the  ubiquitin/protein/reference 
(UPR)  technique,  which  increases  the  accuracy  of  pulse-chase 
assays  [26]  by  providing  a  reference  protein  [23].  In  the  UPR 
technique,  Ub  is  located  between  a  protein  of  interest  and  a 
reference  protein  in  a  linear  fusion  (e.g.  construct  II  in  Fig.  1). 
This  fusion  is  cleaved,  cotranslationally  or  nearly  so,  by  UBPs  at 
the  last  residue  of  Ub,  producing  equimolar  amounts  of  the 
protein  of  interest  and  the  reference  protein  bearing  a  C-terminal 
Ub  moiety.  If  both  the  reference  protein  and  the  protein  of 
interest  are  immunoprecipitated  in  a  pulse-chase  assay,  the 
relative  amounts  of  the  protein  of  interest  can  be  normalized 
against  the  reference  in  the  same  sample.  The  UPR  technique 
can  thus  compensate  for  the  scatter  of  immunoprecipitation 
yields,  sample  volumes  and  other  sources  of  sample-to-sample 
variation  [23]. 

The  UPR  constructs  of  the  present  work  were  fusions 
containing  the  metabolically  stable  Met-DHFR-Ub  moiety  as  a 
reference  protein  (termed  dha-Ub)  and  a  DHFR-based  N-end 
rule  substrate  such  as,  for  example,  Arg-DHFR*"  as  a  protein  of 
interest  (Fig.  1,  construct  III),  To  preclude  the  possibility  that 
the  C-terminal  Ub  moiety  of  dha-Ub  could  function  as  a 
ubiquitylation/degradation  signal,  the  Lys48  residue  of  Ub  (a 
major  site  of  isopeptide  bonds  in  multi-Ub  chains  [17,22,29]) 
was  converted  into  Arg,  which  cannot  be  ubiquitylated  [12], 
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Fig.  3.  inhibition  of  Arg«DHFR‘^  degradation 
in  mouse  cells  by  MTX,  (A)  Lanes  a-d.  mouse  L 
cells  transiently  expressing  a  UPR-bascd  \ersion 
of  Mei-DHFR'^  (Mei-DHFR.ha-Ub^''^Mer- 
DHPR'^ha:  see  Fig.  L  construct  II  )  were  labeled 
at  23  for  10  min  with  f '^S]methionine/ 
cysteine,  followed  by  a  chase,  also  at  23  X. 
for  30,  60  and  120  min  in  the  presence  of 
cycloheximide,  extraction,  immunoprecipitation. 
and  SDS/PAGE  of  Met-DHFR''-ha  and 
Met-DHFR-ha-Ub'^'*^.  Lancs  e-h,  same  as  lanes 
a~d  but  with  Arg-DHFR'"  (Mei-DHFR-ha- 
Ub'^'^^-Arg-DHFR''-ha}.  (B)  Lanes  a-d.  same  as 
lanes  a-d  in  (A),  except  that  both  the  pulse  and 
the  chase  were  at  37  X.  Lanes  e-h.  same  as  lanes 
a-d.  but  with  Arg-DHFR‘'-ha  (Met-DHFR-ha- 
UI^R4N_Arg_DHFR'"-ha).  (C)  Lanes  a-d.  same  as 
lanes  a-d  in  (B)  but  with  20  jjlm  MTX  in  the 
medium.  Lanes  e-h,  same  as  lanes  a-d.  but  with 
Arg-DHFR'^  (Met-DHFR-ha-Ub‘^-‘"-Arg-DHFR‘^ 
ha).  The  bands  of  Arg/Met“DHFR’''-ha  and 
Met-DHPR-ha-Ub*^"*^  (denoted  as  dha-Ub)  are 
indicated.  A  larger  area  of  the  gel  i.s  shown  for 
lanes  a-h  in  (B).  to  illustrate  the  overall 
immunoprecipitation  pattern. 


These  fusions  were  expressed  from  the  cytomegalovirus  early 
promoter.  Pca/\  .  in  mouse  L  cells.  The  cells  were  labeled  for 
10  min  at  either  23  °C  or  37  °C  with  f  '''’S]methionine/cysteine, 
followed  by  a  chase  of  30,  60  and  120  min  at  the  same  tem¬ 
perature  in  the  presence  of  cycloheximide,  immunoprecipitation 
with  anti-ha  monoclonal  antibody,  and  analysis/quantitation  of 
immunoprecipitated  proteins  by  SDS/PAGE  and  Phosphor- 
Imager,  using  UPR  (see  Materials  and  methods).  Pulse -chase 
assays  in  which  cycloheximide  (a  translation  inhibitor)  was 
omitted  yielded  similar  results  (data  not  shown). 

Both  Met-DHFR*"'  and  Arg-DHro’''  were  long-lived  in  mouse 
cells  at  23  X  (/o  s  >  10  h)  (Fig.  3A,  lanes  a-h,  and  Fig.  4A). 
By  contrast.  Arg-DHFR*"  was  short-lived  at  37  X  in  mouse 
cells:  its  half-life,  determined  between  0  and  60  min  of  chase 
and  denoted  as  *  [23],  was  ===  10  min  (Fig.  3B,  lanes  e-h, 
and  Fig.  4A).  In  addition,  a  large  fraction  (~  30%)  of  the  pulse- 
labeled  Arg-DHFR*"  (but  not  of  Mel-DHFR^'')  was  degraded 
during  the  10-min  pulse  at  37  X,  as  could  be  seen  from 
comparing  the  UPR-normalized  amounts  of  Arg-DHFR*^  at  lime 
0  (the  beginning  of  the  chase)  at  23  X  and  37  X  (Fig.  4A).  In 
contrast  with  a  conventional  pulse-chase  assay,  the  use  of  UPR 
in  a  setting  where  a  protein  can  be  made  either  short-lived  or 
long-lived  allows  the  detection  and  measurement  of  proteolysis 
not  only  during  the  chase  but  during  the  pulse  as  well  [23].  The 
extent  of  degradation  of  a  protein  during  the  pulse  is  denoted  as 
ID'  [23].  In  the  present  context,  the  variable  ID'  is  defined  as  the 
extent  of  degradation  of  a  radiolabeled  short-lived  protein  (Arg- 
DHFR'^  at  37  X)  at  the  end  of  a  pulse  relative  to  the  amount  of 
a  nearly  identical  protein  (Met-DHFR'"  at  37  X)  that  is  long- 
lived  under  the  same  conditions. 

A  large  fraction,  but  not  all,  of  the  Arg-DHFR‘'  degradation  in 
mouse  cells  was  carried  out  by  the  N-end  rule  pathway.  This 
could  be  seen  by  comparing  the  UPR-based  decay  curves  of 
Met-DHFR'"  and  Arg-DHFR*"  at  37  X  (Fig.  3B  lanes  a-d  vs. 
lanes  e-h  and  Fig.  4A).  Specifically,  not  only  Arg-DHFR*"  but 
also  Met-DHFR*"  were  metabolically  unstable  at  37  X  in  mouse 
cells,  the  corresponding  being  10  min  and  ^  48  min, 
respectively  (Fig.  4A).  As  Met  is  a  stabilizing  residue  in  the 


N-end  rule  [23.30],  we  infer  that  Arg-DHFR*"  was  targeted  not 
only  by  the  N-end  rule  pathway,  but  also  by  another  proteolytic 
system,  which  recognized  a  structural  feature  that  resulted  from 
a  conformational  perturbation  of  the  DHFR'"  moiety  at  37  X. 
This  aspect  of  Arg-DHFR*"  acted  as  a  degron  in  mouse  cells  but 
not  in  yeast,  inasmuch  as  the  same  protein  was  targeted 
exclusively  by  the  N-end  rule  pathway  in  5.  cerevisiae  at  37  X 
(Fig.  2A  lanes  g-i  vs.  lanes  j-1).  The  nature  of  this  degradation 
signal  in  the  conformational ly  perturbed  (at  37  X)  DHFR*" 
moiety  is  not  known. 

Inhibition  of  both  degradation  signals  of  Arg-DHFR*''-ha  by 
MTX  in  mouse  cells 

Mouse  L  cells  expressing  Met-DHFR*"  (dha-Ub-Met-DHFR*")  or 
Arg-DHFR*"  (dha-Ub- Arg-DHFR*")  were  incubated  at  37  X  for 
30  min  with  20  p.M  MTX  in  the  medium,  followed  by  a  10-min 
pulse  with  [■^''S]methionine/cysteine,  a  chase,  and  SDS/PAGE 
analysis.  The  degradation  of  Met-DHFR*"  at  37  X,  which  is 
mediated  by  a  non-N-degron  in  the  conformationally  perturbed 
DHFR*"  moiety  (see  above),  was  virtually  completely  inhibited 
by  MTX  (Fig.  3B  lanes  a-d  vs.  Fig.  3C  lanes  a-d  and  Fig.  4B). 
Specifically,  the  of  Met-DHFR*"  at  37  X  was  ^  48  min  in 
the  absence  of  MTX  and  >  10  h  in  the  presence  of  MTX 
(Fig.  4B).  The  much  more  rapid  degradation  of  Arg-DHFR*" 
carried  out  by  both  the  N-end  rule  pathway  and  the  other 
proteolytic  pathway  was  strongly  but  incompletely  inhibited  by 
MTX  (Fig.  3B  lanes  e-h  vs.  Fig.  3C  lanes  e-h  and  Fig.  4B). 
Specifically,  in  the  absence  of  MTX,  the  of  Arg-DHFR*" 
was  ^  10  min  (ID*^^  of  36%)  (Fig.  4B).  In  the  presence  of 
MTX,  these  variables  changed  to  ^  37  min  and  ID”*^  of 
29%  (Fig.  4B). 

A  comparison  of  the  corresponding  decay  curves  (Fig.  4), 
made  more  reliable  by  the  increased  accuracy  of  a  UPR-based 
pulse-chase  assay,  suggested  one  reason  for  the  leaky  inhibition 
of  the  N-degron  of  Arg-DHFR*"  by  MTX.  Specifically,  the  ID'" 
of  Met-DHFR*"  (i.e.  the  extent  of  Met-DHFR*"  degradation 
during  the  pulse)  was  <  10%  in  the  absence  of  MTX,  whereas 
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Fig.  4.  Decay  curves  of  Met-DHFR*"  and  Arg-DHFR‘‘'  at  23  '‘C  and 
37  °C  in  mouse  cells,  in  the  absence  (A)  and  presence  of  MTX  (B).  These 
curves  were  derived  from  the  UPR-based  electrophoretic  data  (Figs  3  and  5 
and  analogous  evidence),  as  described  in  Materials  and  methods. 


the  ID'“  of  Arg-DHFR*"  under  the  same  conditions  was  ==  369^. 
In  other  words,  more  than  one-third  of  labeled  Arg-DHFR*" 
molecules  were  degraded  during  the  10-min  pulse  (Fig.  4B). 
Moreover,  the  ID**'  of  Arg-DHFR*''  remained  high  (=^  309^) 
even  in  the  presence  of  MTX  (Fig.  4B).  This  result  suggested 
that  a  large  fraction  of  the  newly  formed  Arg-DHFR'^'  molecules 
was  targeted  for  degradation  by  the  N-end  rule  pathway  before 
the  conformation  of  Arg-DHFR'^  w^as  mature  enough  to  allow 
high-affinity  binding  of  MTX  (see  the  Discussion). 


An  exposed  N-terminus  renders  Arg-DHFR*"  constitutively 
short-lived 

To  address  the  relative  contributions  of  the  first  (N-termina! 
Arg)  and  the  second  (internal  Lys)  determinant  of  the  N-degVon 
in  Arg-DHFR’\  we  extended  its  N-terminus  with  a  42*residue, 
Escherichia  coli  Lac  repressor-derived  sequence,  termed  e"^*^ 
[extension  (e)  lacking  lysines  (AK)]  [16].  Met-e^*^-DHFR‘^  (Ub- 
Met-e'^’^-DHFR*")  (Fig..i,  construct  IV),  which  bore  a  stabiliz¬ 
ing  N-terminal  residue,  was  long-lived  at  23  °C  in  mouse  cells 
>  10  h)  (Fig.  5A.  lanes  a-c).  Arg-e‘^‘^-DHFR’^  (Ub-Arg- 
e’^^-DHFR*'')  (Fig.  1,  construct  V),  w'hich  bore  a  destabilizing 
N-terminal  residue,  was  short-lived  at  23  °C  45  min; 

Fig.  5A,  lanes  d~f),  in  contrast  with  the  otherwise  identical  Arg- 
DHFR'"  that  lacked  the  extension  and  was  long-lived  at 
23  °C  (Fig.  3A,  lanes  e-h).  Note  that  Arg-e’^’^-DHFR'"  w^as 
short-lived  at  23  °C,  in  spite  of  the  fact  that  the  e"^*^  extension 
did  not  contribute  additional  Lys  residues  to  Arg-DHFR'\  w^hich 
w'as  long-lived  at  23  °C. 

At  37  °C,  Met-e'^’^-DHFR*''  was  approximately  as  short-lived 
as  Met-DHFR*'';  the  degradation  of  these  proteins  was  carried 
out  by  a  non-N-end  rule  pathway  that  targeted  a  degron  (d^)  in 
the  conformational ly  perturbed  DHFR*"'  moiety  (Fig.  5B,  lanes 
a-c:  compare  with  Fig.  3B,  lanes  a-d).  In  contrast,  the 
degradation  of  Arg-e'^^^-DHFR*"  at  37  °C  was  much  faster  than 
that  of  Met-e'^*^-DHFR*^  the  difference  in  rate  being  due  to  the 
targeting  of  Arg-e^*^-DHFR*''  by  both  the  N-end  rule  pathway 
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Fig.  5.  Effects  of  temperature  and  MTX  on  the  degradation  of  N-terminally  extended  DHFR‘'‘  and  DHFR  in  mouse  cells.  (A)  Lanes  a-c.  mouse  L-cells 
transiently  expressing  Met-e^^-DHPR'"*  (Fig.  I,  construct  IV)  were  labeled  at  23  for  10  min  with  [^^S]methionine/cysteine,  followed  by  a  chase,  also  at  23  °C. 
for  60  and  120  min  in  the  presence  of  cycloheximide.  extraction,  immunoprecipiiation,  and  SDS/PAGE  analysis.  Lanes  d-f,  same  as  lanes  a-c  but  with  Arg-e'^'^- 
DHFR‘\  (B)  Lanes  a-f.  same  as  lanes  a-f  in  (A),  but  the  pulse -chase  was  carried  out  at  37  T.  (C)  Lanes  a-c,  same  as  lanes  a-c  in  (B),  but  the  pulse-chase  of 
Met-e'^^^-DHFR*''  was  carried  out  in  the  presence  of  20  pM  MTX  in  the  growth  medium.  Lanes  d-f.  same  as  lanes  a-c,  but  with  Arg-e‘^*^-DHFR‘''.  (D)  Lanes  a-c, 
same  as  lane.s  a-c  in  (B),  but  with  Met-C^^-DHFR  (the  wild-type  DHFR  moiety).  Lanes  d-f,  same  as  lanes  a-c,  but  with  Arg-c^^-DHFR.  The  asterisk  indicates 
a  cleavage  product  derived  from  a  minority  of  short-lived  Arg-e-^^-DHFR'^  and  Arg-e'^^-DHFR  molecules  during  their  targeting  by  the  N-end  rule  pathway  (see 
the  main  text).  (E)  Lanes  a-c,  same  as  lane  a-c  in  (D),  but  in  the  presence  of  20  pM  MTX.  Lanes  d-f,  same  as  lanes  a-c  but  with  Arg-e'^’^-DHFR. 
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and  the  -recognizing  pathway  (Fig.  5B,  lanes  d-f).  A  minor 
proteolytic  fragment  of  Arg-e'^‘^-DHFR‘\  visible  in  Fig.  5B-E 
(lanes  d-f).  was  specific  to  N-cnd  rule  substrates  bearing  an 
e'^^-type  extension:  an  analogous  fragment  was  also  observed 
with  X-e’^-Pgal-hased  N-end  loile  substrates  [16]. 

We  also  examined  the  metabolic  fate  of  Met-e'^^-DHFR  (Ub- 
Met-e^'^-DHFR)  and  Arg-e^'^-DHFR  (Ub-Arg-e^^-DHFR). 
which  were  identical  with  the  substrates  above  except  that  they 
bore  the  wild-type  mouse  DHFR  moiety  (Fig.  1.  constiucts  VI 
and  VII).  Met-e^'^-DHFR  was  long-li\Td  at  both  23  T  and 
37  "C  in  mouse  cells  (Fig.  5D.  lanes  a-c.  and  data  not  shown), 
indicating  that  the  heat-activated  d^-type  degron  of  substrates 
containing  the  DHFR'"  moiety  (e.g.  Fig.  4A)  resulted  from  the 
Pro66  gr;  Leu66  mutation  that  yielded  this  moiety.  The 
degradation  of  Arg-e^’^-DHFR  was  mediated  exclusively  by 
the  N-end  rule  pathw^ay  (Fig,  5D.  lanes  d-f  vs.  lanes  a-c), 
unlike  the  double-pathway  degradation  of  Arg-e'^*^-DHFR‘"  at 
37  T  (Fig.  5C.  lanes  d-f).  In  contrast  w'ith  the  results  w'ith  Arg- 
DHFR^"  and  Met-DHFR‘",  the  presence  of  MTX  did  not  result  in 
a  significant  stabilization  of  Arg-e'^^-DHFR  (Fig.  5D  lanes  d-f 
vs.  Figure  5E  lanes  d-f). 

DISCUSSION 

W’e  report  the  following  results. 

(a)  Arg-DHFR‘"  is  long-lived  at  23  ""C  but  short-lived  at  37  °C 
in  both  5-  cerevisiac  and  mouse  cells. 

(b)  In  yeast,  Arg-DHFR'"  is  degraded  (at  37  °C)  exclusively 
by  the  N-end  rule  pathw-ay,  whereas  in  mouse  cells  the  same 
protein  at  the  same  temperature  is  degraded  by  another 
proteolytic  pathway  as  well.  The  corresponding  mouse-specific 
degron  is  active  at  37  °C  but  inactive  at  23  °C.  This  degradation 
signal  is  a  currently  unknown  feature  of  the  perturbed  con¬ 
formation  of  the  DHFR'"  moiety  at  37  °C,  which  differs  from  the 
wild-tvpe  DHFR  moiety  by  a  Pro  ^  Leu  alteration  at  position 
66. 

(c)  MTX,  a  low-A/r  ligand  of  DHFR  (ATj  10  pM),  inhibits 
the  degradation  of  Arg-DHFR‘"  (at  37  °C)  nearly  completely  in 
yeast  and  partially  in  mouse  cells. 

(d)  The  ability  of  MTX  to  inhibit  the  degradation  of  Arg- 
DHFR"'  in  vivo  is  retained  when  Arg-DHFR''^  is  utilized  as  a 
portable  degron  that  confers  metabolic  instability  on  linked 
unrelated  proteins  in  yeast. 

(e)  Using  the  UPR  technique  [23]  to  determine  the  metabolic 
fate  of  Arg-DHFR'"  in  mouse  cells,  we  found  that  ^  36%  of  the 
labeled  Arg-DHFR'"  molecules  are  degraded  during  the  10-min 
pulse,  and  that  the  presence  of  MTX  decreases  this  fraction  only 
slightly,  to  ~  30%.  This  finding  accounts  for  the  leaky  inhi¬ 
bition  of  Arg-DHFR'"  degradation  by  MTX  in  mouse  cells. 
Specifically,  this  finding  suggests  that  a  large  fraction  of  the 
nascent  Arg-DHFR'"  is  targeted,  in  mouse  cells,  by  the  N-end 
rule  pathway  before  the  conformation  of  the  DHFR'"  moiety  is 
mature  enough  [28]  to  allow  high-affinity  binding  of  MTX,  That 
the  MTX-mediated  inhibition  of  degradation  of  Arg-DHFR'"  in 
yeast  is  much  less  leaky  than  in  mouse  cells  is  but  one  difference 
in  the  detailed  properties  of  the  yeast  and  mammalian  N-end 
rule  pathways  [see  also  item  (f)]. 

(f )  When  the  N-terminus  of  Arg-DHFR'"  was  extended  with  a 
42-residue  lysine-lacking  extension,  termed  the  N-degron 
of  the  resulting  Arg-e'^^-DHFR'"  was  active  at  both  23  X  and 
37  °C,  unlike  the  mouse-specific  degron,  which  remained 
inactive  at  23  °C.  Moreover,  the  degradation  of  Arg-e*^*^- 
DHFR'",  in  contrast  with  that  of  Arg-DHFR'",  could  not  be 
inhibited  by  MTX,  suggesting  that  the  inactivity  of  the  N-degron 
in  Arg-DHFR'"  at  23  °C  results  at  least  in  part  from  inaccessibility 


C  FEBS  1999 

of  its  N-terminal  arginine  to  the  targeting  complex  of  the  N-cnd 
rule  pathway.  Arg-e‘^^-DHFR,  which  bore  the  wild-type  DHFR 
moiety,  w'as  also  short-lived  in  mouse  cells.  Pieviously,  the 
same  protein  w'as  found  to  be  long-lived  in  yeast,  unless  the  e"^*" 
extension  w'as  replaced  by  an  otherwise  identical  extension, 
termed  c*",  which  contained  Lys  residues  that  functioned  as  the 
second  determinant  of  the  N-degron  [16]. 

Potential  applications  of  a  heat-inducible.  MTX-suppressible 
portable  N-degron  include  its  use  to  produce  conditional 
mutants  in  homeothermic  animals  such  as  mouse.  One  difficulty 
in  the  cun’ent  approaches  to  this  problem  [31-33]  is  that  either  a 
deletion  or  transcriptional  repression  of  a  gene  of  interest  leaves 
the  previously  produced  gene  product  unperturbed.  If  this 
protein  is  long-lived  and  resides  in  a  non-dividing  cell,  there 
may  be  a  considerable  phenotypic  lag  betw  een  the  conditionally 
introduced  genetic  change  and  the  actual  inactivation  or  dis¬ 
appearance  of  a  corresponding  protein.  A  degron  that  can  be 
regulated  by  a  cell-penetrating  ligand  can  be  employed  to 
address  this  problem., Since  chronic  administration  of  MTX  is 
toxic  to  mammals,  it  wall  be  necessary  to  construct  an  analogous 
DHFR-based  N-degron  that  uses  E.  coli  DHFR  and  is  inhibited 
by  trimethoprim,  which  binds  tightly  to  the  £.  coli  but  not  the 
mouse  DHFR  [34].  If  a  portable  N-degron  of  this  kind  could  be 
constructed,  combining  it  with  a  conditional  repression  of  a  gene 
of  interest  should  yield  better  methods  for  producing  conditional 
mutants  in  homeothermic  animals, 

MTX,  through  its  binding  to  the  substrate  pocket  of  DHFR. 
stabilizes  DHFR  conformation  [35,36].  This  effect  was 
employed  in  several  studies  with  DHFR  as  a  reporter  protein. 
For  example,  the  binding  of  MTX  to  DHFR  can  preclude,  under 
certain  conditions,  the  translocation  of  a  DHFR-containing 
fusion  protein  across  biological  membranes  [37,38].  MTX  can 
partially  protect  DHFR  against  in  vitro  proteolysis  by  thermo- 
lysin  [38,39].  MTX  has  also  been  shown  to  block  the  degra¬ 
dation  of  a  targeted  multiubiquitylated  Arg-e^-DHFR  in 
reticulocyte  extract  [27].  In  the  latter  study,  the  N-end  rule 
pathway  in  reticulocyte  extract  was  presented  with  a  prefolded 
wild-type  DHFR  moiety.  By  contrast,  in  the  present  work  with 
intact  cells,  the  same  pathway  could  target  a  DHFR-based  N-end 
rule  substrate  immediately  after  (and  possibly  even  during)  its 
synthesis.  Further  analysis  is  required  to  clarify  the  apparent 
discrepancy  between  the  previously  observed  inhibition  of 
degradation  of  Arg-e’^-DHFR  by  MTX  in  vitro  (in  reticulocyte 
extract)  [27]  and  the  absence  of  a  comparable  effect  of  MTX  in 
vivo  on  this  class  of  substrates,  in  which  N-terminal  Arg  is 
constitutively  exposed.  One  difference  between  the  two  settings 
is  the  presentation  of  prefolded  DHFR  moieties  in  reticulocyte 
extract  compared  with  a  kinetic  competition  between  the  folding 
of  newly  formed  DHFR  moieties  and  their  targeting  for 
degradation  by  the  N-end  rule  pathway. 

The  use  of  the  UPR  technique  not  only  increased  the  overall 
accuracy  of  pulse-chase  assays  with  mouse  cells  but  also 
showed  that  «  30%  of  the  newly  made  Arg-DHFR'"  was 
degraded  during  the  pulse.  The  finding  that  the  fraction  of  Arg- 
DHFR'"  degraded  during  the  pulse  decreased  only  slightly  in  the 
presence  of  MTX  strongly  suggests  that  most  of  the  labeled  Arg- 
DHFR'"  is  confoimationally  immature  shortly  after  synthesis, 
and  therefore  cannot  form  a  high-affinity  complex  with  MTX. 
The  Pro  — ►  Leu  alteration  at  position  66  that  yielded  DHFR'" 
occurred  in  the  region  of  DHFR  that  interacts  with  the  aromatic 
ring  of  MTX  [35],  and  is  likely  to  have  resulted  in  a  decreased 
affinity  of  DHFR  for  MTX.  Nonetheless,  the  test  proteins 
containing  the  DHFR'"  moiety  were  efficiently  retained  on  an 
MTX  affinity  column  (data  not  shown),  and  their  in  vivo 
degradation  could  be  specifically  inhibited  by  MTX. 
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A  model  of  targeting  by  the  N-end  rule  pathway  that  accounts 
for  our  findings  with  the  heat-inducible  N-degron  assumes  that 
the  main  cause  of  metabolic  stability  of  Arg-DHFR^'*  at  23  is 
not  the  absence  of  a  sterically  accessible  Lys  residue  within 
the  folded  DHFR  globule,  but  rather  a  steric  inaccessibility  of 
the  N-temiinal  Arg  residue.  In  this  model,  the  activation  of  the 
previously  cryptic  N-degron  of  Arg-DHFR'''  at  37  is  caused 
by  an  increased  exposure  and  flexibility  of  a  region  bearing  the 
N-terminal  Arg,  a  transition  that  leads  to  the  binding  of  Arg  by 
the  targeting  complex  of  the  N-end  rule  pathway.  According  to 
this  interpretation,  MTX  stabilizes  Arg-DHFR’''  against  degra¬ 
dation  at  37  not  by  precluding  the  conformational  mobili¬ 
zation  of  internal  Lys  residues,  but  by  precluding  a  temperature- 
mediated  increase  in  the  exposure  of  the  N-terminal  Arg.  In 
addition  to  being  consistent  with  the  available  evidence,  this 
model  accounts  for  the  otherwise  puzzling  result  that  Arg-e^^- 
DHFR‘''-ha  (which  differs  from  Arg-DHFR^"  by  the  presence  of 
e^*^)  is  short-lived  at  both  23  °C  and  37  °C,  whereas  Arg- 
DHFR‘"  is  short-lived  only  at  37  °C.  Indeed,  the  extension 
would  result  in  a  temperature-independent  enhanced  exposure  of 
N-terminal  Arg,  a  change  that  would  be,  in  this  model,  sufficient 
for  the  temperature-independent  activity  of  the  N-degron  in  Arg- 
e^’^-DHFR’\  The  earlier  model  proposed  by  Dohmen  et  al.  [19], 
which  presumed  that  the  inactivity  of  the  N-degron  in  Arg- 
DHFR'^  at  23  was  caused  by  the  absence  of  sterically 
accessible  Lys  residues,  cannot  account  for  the  above  result. 

How  can  we  verify  the  Arg-exposure  model?  One  way  is 
suggested  by  the  existence,  in  both  yeast  and  mammals,  of  the 
enzymes  N-teiTninal  amidase  (Nt-amidase)  and  Arg-tRNA- 
prolein  transferase  (R-transferase),  which  chemically  modify 
specific  (sterically  exposed)  N-terminal  residues  of  proteins  in 
the  cytosol.  In  S.  cerevisiac\  the  ATA 7 -encoded  Nt-amidase 
deamidates  N-terminal  Asn  or  Gin;  the  A7£7-encoded  R- 
transferase  conjugates  Arg  to  N-terminal  Asp  or  Glu  (reviewed 
in  [10]).  A  test  of  the  model  would  involve  the  expression  of,  for 
example.  Asn-DHFR'^  (identical  to  Arg-DHFR*'’  except  for  the 
presence  of  N-tenninal  Asn)  in  yeast  cells  that  lack  Ubrlp,  a 
component  of  the  N-end  rule  pathway  that  recognizes  primary 
destabilizing  N-terminal  residues  such  as  Arg.  (The  absence  of 
Ubrlp  would  preclude  the  degradation  of  test  proteins.)  The 
model  predicts  that  at  23  °C  the  N-terminal  Asn  of  Asn-DHFR^" 
would  be  deamidated  inefficiently  or  not  at  all,  in  contrast  with 
what  happens  at  37  °C.  This  can  be  verified  by  isolating  Asn- 
DHFR*"  from  cells  incubated  at  either  23  or  37  and 
determining  its  N-terminal  residue.  If  the  model  is  correct,  the 
N-terminal  sequence  must  begin  largely  or  entirely  with  Asn  at 
23  °C,  but  would  become  Arg-Asp-...  at  37  °C.  (In  the  latter 
case,  the  N-terminal  Asp,  produced  from  Asn.  would  be 
arginylated  by  R-transferase.)  Experimental  verification  of  this 
model  is  under  way. 
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Summary 

We  report  the  discovery  of  a  short-lived  chaperone 
that  is  required  for  the  correct  maturation  of  the  eukar¬ 
yotic  20S  proteasome  and  is  destroyed  at  a  specific 
stage  of  the  assembly  process.  The  5.  cerev/s/ae  Umpip 
protein  is  a  component  of  proteasome  precursor  com¬ 
plexes  containing  unprocessed  p  subunits  but  is  not 
detected  in  the  mature  20S  proteasome.  Upon  the  as¬ 
sociation  of  two  precursor  complexes,  Umpip  is  en¬ 
cased  and  is  rapidly  degraded  after  the  proteolytic 
sites  in  the  interior  of  the  nascent  proteasome  are 
activated.  Cells  lacking  Umpip  exhibit  a  lack  of  coordi¬ 
nation  between  the  processing  of  p  subunits  and  pro¬ 
teasome  assembly,  resulting  in  functionally  impaired 
proteasomes.  We  also  show  that  the  propeptide  of 
the  Pre2p/Doa3p  p  subunit  is  required  for  Umpip's 
function  in  proteasome  maturation. 

Introduction 

Ubiquitin  (Ub)-dependent  proteolysis  underlies  the  bulk 
of  nonlysosomal  protein  degradation  in  eukaryotic  cells 
(reviewed  by  Hochstrasser,  1996;  Varshavsky,  1997).  Natu¬ 
rally  short-lived  as  well  as  damaged  or  otherwise  abnor¬ 
mal  proteins  are  recognized  by  the  Ub  system  and  are 
marked  for  degradation  by  the  attachment  of  multi-Ub 
chains.  Ubiquitylated  proteins  are  degraded  by  the  26S 
proteasome,  an  ^2000  kDa,  multisubunit,  ATP-depen- 
dent  protease  that  consists  of  the  1 9S  complex,  which  is 
required  specifically  for  the  degradation  of  ubiquitylated 
proteins,  and  an  ~700  kDa  complex,  called  the  20S 
proteasome,  which  is  the  ATP-independent  catalytic 
core  of  the  26S  proteasome  (reviewed  by  Peters,  1994; 
Coux  et  al.,  1996;  Lupas  et  al.  1997). 

The  20S  proteasome  is  universal  among  eukaryotes; 
its  structural  homologs  have  also  been  found  in  archae¬ 
ons  and  eubacteria  (reviewed  by  Coux  et  al.,  1996).  High- 
resolution  crystal  structures  have  been  reported  for  the 

^To  whom  correspondence  should  be  addressed. 


20S  proteasomes  of  the  archaeon  Thermoptasma  acido- 
philum  and  the  eukaryote  Saccharomyces  cerevisiae 
(Ldwe  et  al.,  1995;  Groll  et  al.,  1997).  The  Thermoplasma 
proteasome  contains  two  types  of  subunits,  a  and  p, 
which  form  a  hollow  cylinder  composed  of  four  hepta- 
meric  rings  in  the  configuration  a7P7P7a7.  The  yeast  20S 
proteasome  has  a  similar  structure  but  contains  14  dis¬ 
tinct  subunits,  seven  of  the  a  type  and  seven  of  the  p 
type.  The  N-terminal  threonines  of  the  p  subunits  of 
the  Thermoplasma  proteasome  act  as  nucleophiles  In 
catalyzing  the  hydrolysis  of  peptide  bonds  of  polypep¬ 
tide  substrates  (Seemuller  et  al.,  1995).  Lys-33  and  Glu- 
17  of  the  p  subunit  also  play  a  role  in  catalyzing  the 
cleavage  of  peptide  bonds.  The  14  identical  p  subunits 
of  the  Thermoplasma  proteasome  form  14  identical  ac¬ 
tive  sites,  which  catalyze  the  cleavage  of  substrates 
after  hydrophobic  amino  acid  residues  (chymotrypsln- 
like  active  sites).  These  sites  are  located  on  the  inner 
surface  of  a  central  chamber  formed  by  the  two  rings 
of  p  subunits  (Lowe  et  al.,  1995;  SeemOller  et  al.,  1995). 
P  subunits  are  synthesized  as  inactive  precursors  con¬ 
taining  a  propeptide  that  Is  thought  to  be  cleaved  off 
autocatal^lcally,  yielding  the  mature  p  subunit  bearing 
N-terminal  Thr.  The  propeptides  of  p  subunits  are  not 
required  for  the  in  vitro  assembly  of  the  Thermoplasma 
proteasome  (Seemuller  et  al.,  1996). 

In  eukaryotic  proteasomes,  only  3  of  the  7  distinct  p 
subunits  contain  the  three  conserved  residues  required 
for  activity  of  the  Thermoplasma  proteasome.  Genetic 
and  structural  data  suggest  that  these  three  subunits 
provide  the  active-site  nucleophiles  for  the  three  distinct 
catalytic  activities  of  eukaryotic  proteasomes,  namely 
the  "chymotrypsin-like"  (see  above),  the  "trypsin-like" 
(cleavage  after  basic  residues),  and  the  "peptidylgluta- 
myl  peptide  hydrolyzing  (PGPH)"  activity  (cleavage  after 
acidic  residues).  In  S.  cerevisiae,  these  three  p  subunits 
are  Pre2p/Doa3p,  Pupip,  and  Pre3p  (Heinemeyer  et  al., 
1993;  Chen  and  Hochstrasser,  1996;  Arendt  and  Hoch¬ 
strasser,  1997;  Groll  et  al.,  1997;  Heinemeyer  et  al., 
1997).  In  the  human  20S  proteasome,  the  related  pro¬ 
teins  are  MB1,  8,  and  Z;  during  the  immune  response, 
these  subunits  can  be  replaced  by  their  respective  ho¬ 
mologs  LMP7,  LMP2,  and  MECL-1/LMP10  (reviewed  by 
Coux  et  al.,  1996).  In  the  case  of  LMP2  and  LMP7,  it 
has  been  demonstrated  that  these  replacements  are 
functionally  relevant  in  altering  the  specificity  of  antigen 
presentation  by  the  MHC  class  I  pathway.  The  catalyti- 
cally  active  p  subunits  of  the  eukaryotic  proteasome  are 
synthesized  with  propeptides,  similarly  to  the  p  subunit 
in  Thermoplasma  (Schmidtke  et  al.,  1 997). 

In  the  crystal  structure  of  the  S.  cerevisiae  2QS  protea¬ 
some,  the  opening  to  the  proteasome's  Interior,  formed 
by  the  outer  ring  of  the  a  subunits,  is  not  large  enough 
to  admit  even  an  unfolded  polypeptide  chain,  let  alone 
a  folded  protein  (Groll  et  al.,  1997).  The  degradation  of 
larger  substrates  requires  the  19S  complexes,  which 
attach  at  both  sides  of  the  20S  proteasome,  yielding  the 
26S  proteasome.  The  19S  complexes  contain  subunits 
that  bind  multl-Ub  chains,  at  least  one  Ub-specific  iso¬ 
peptidase  that  disassembles  these  chains,  and  several 
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ATPases  that  are  thought  to  be  involved  in  perturbing 
the  substrate  conformationally  and  guiding  it  to  the  inte¬ 
rior  of  the  20S  proteasome  (Deveraux  et  al.,  1 994;  Hoch- 
strasser  et  al.,  1995;  Jentsch  and  Schlenker,  1995;  Coux 
et  a!.,  1996). 

Studies  with  mammalian  cells  have  shown  that  the 
20S  proteasome  Is  assembled  through  a  15-16S  inter¬ 
mediate,  apparently  a  half-proteasome.  This  intermedi¬ 
ate  contains  al!  14  a  and  p  subunits,  some  of  which  are 
in  the  precursor  (propeptide-bearing)  form,  and  several 
uncharacterized  polypeptides  as  well  (Frentzel  et  a!., 
1994;  Yang  et  al.,  1995;  Nandi  et  al.,  1997;  Schmidtke 
et  al.,  1997).  Studies  on  the  yeast  20S  proteasome  have 
demonstrated  that  processing  of  proPre2p  (identical  to 
proDoaSp)  is  coupled  to  formation  of  the  20S  protea¬ 
some  from  two  half-proteasome  precursors  (Chen  and 
Hochstrasser,  1996).  Formation  of  the  active  site  capa¬ 
ble  of  autocatalytic  processing  of  proPre2p  depends  on 
the  juxtaposition  of  proPre2p  and  Prel  p  on  the  opposite 
sides  of  the  two  halves  of  the  proteasome.  This  mecha¬ 
nism  is  thought  to  prevent  activation  of  proteolytic  sites 
before  the  central  hydrolytic  chamber  has  been  sealed 
off  from  the  cytosol  through  association  of  the  two 
halves  of  the  proteasome  (Chen  and  Hochstrasser, 
1996).  The  Pre2p  propeptide  is  essential  for  the  forma¬ 
tion  of  functional  proteasomes.  Moreover,  this  propep¬ 
tide  can  operate  in  trans,  suggesting  that  it  serves  a 
chaperone -like  function  In  proteasome  maturation  (Chen 
and  Hochstrasser,  1996). 

In  the  present  work,  we  identify  Umpip,  a  novel  pro¬ 
tein,  as  a  component  of  a  precursor  complex  of  the  20S 
proteasome.  This  precursor  contains  unprocessed  p 
subunits.  Upon  formation  of  the  20S  proteasome  from 
two  such  precursors,  the  propeptides  of  p  subunits  are 
removed,  a  process  that  is  accompanied  by  removal  of 
Umpip  from  the  proteasome  through  Umpip  degrada¬ 
tion,  which  requires  the  proteasome's  proteolytic  activ¬ 
ity.  In  umplA  cells,  which  lack  Umpip,  coordination  of 
20S  complex  formation  and  processing  of  p  subunits  is 
impaired,  resulting  in  incompletely  or  (in  the  case  of 
Pre2p)  prematurely  processed  p  subunits.  These  find¬ 
ings  reveal  Umpip  as  a  novel  type  of  molecular  chaper¬ 
one,  a  short-lived  maturation  factor  required  for  the 
efficient  biogenesis  of  the  20S  proteasome.  We  also 
describe  genetic  evidence  that  the  propeptide  of  pro- 
Pre2p  is  required  for  Umpip-dependent  proteasome 
maturation,  and  we  present  a  model  that  accounts  for 
some  of  the  functions  of  Umpip  and  propeptides  in 
proteasome  maturation. 

Results 

The  UMP1  Gene  Is  Required  for 
Ubiquitin-Mediated  Proteolysis 
To  identify  the  genes  required  for  Ub-dependent  prote¬ 
olysis  In  S.  cerevisiae,  we  screened  for  mutants  defec¬ 
tive  in  the  degradation  of  test  substrates.  One  such 
mutant,  termed  ump1-1  (ub-mediated  proteolysis),  ex¬ 
hibited  defects  in  the  degradation  of  several  normally 
short-lived  proteins  (see  below).  The  complementing 
UMP1  gene  encoded  a  148-residue  (16.8  kDa)  protein. 
Searches  of  the  databases  did  not  identify  close  homo¬ 
logs  of  Umpip  but  did  detect  similarities  to  regions  of 


several  proteins.  One  intriguing  similarity  was  between 
Umpip  and  C-terminal  regions  of  the  protease  inhibitor 
contrapsin  (25%  identity,  52%  similarity)  and  related 
proteins  (Figure  1A),  consistent  with  the  likely  role  of 
Umpip  as  an  inhibitor  of  premature  proteolytic  acti¬ 
vation  of  the  proteasome  precursor  complexes  (see 
below). 

Pulse-chase  analyses  in  ump7A  mutants  revealed  a 
strong  stabilization  of  several  normally  short-lived  test 
proteins,  in  particular  an  N-end  rule  substrate  Arg-p- 
galactosidase  (Arg-p-gal),  which  bears  N-terminal  Arg, 
a  destabilizing  residue  (Varshavsky,  1996),  Ub-Pro-p- 
gal  (a  substrate  of  the  UFD  pathway;  Johnson  et  al., 
1995;  Varshavsky,  1997),  and  a2degi-p-gal  (a  substrate  of 
the  DOA  pathway;  Hochstrasser  and  Varshavsky,  1990; 
Hochstrasser  et  al.,  1995)  (Figure  IB).  The  recognition 
and  ubiquitylation  of  these  substrates  involve  different 
recognins  (E3  proteins)  and  different  Ub-conjugating 
(E2)  enzymes.  The  pleiotropic  character  of  the  ump7A 
phenotype  suggested  that  Umpip  functions  downstream 
of  the  recognition  and  ubiquitylation  components  of  the 
Ub  system,  perhaps  at  the  step  of  proteolysis,  or  in 
regulating  the  supply  of  Ub. 

ump1\  Mutants  Are  Hypersensitive  to  a  Variety 
of  Stresses,  Accumulate  Ub-Protein 
Conjugates,  and  Do  Not  Sporulate 
ump7A  mutants  grew  more  slowly  than  congenic  wild- 
type  (wt)  cells  at  SOX  or  lower  temperatures  and  were 
severely  growth-impaired  at  higher  temperatures  (37°C) 
(Figure  1C  and  data  not  shown).  In  addition,  they  were 
hypersensitive  to  cadmium  ions  and  to  the  arginine  ana¬ 
log  canavanine  (Figure  1C).  Similar  phenotypes  have 
been  reported  for  mutants  in  genes  encoding  E2  (Ubc) 
enzymes  or  proteasome  subunits  (Heinemeyer  et  al., 
1993;  Jungmann  et  al.,  1993).  Comparisons  of  proteins 
from  whole-cell  extracts  of  wt  and  umpita  cells  by  immu- 
noblotting  with  anti-Ub  antibodies  showed  a  dramatic 
accumulation  of  Ub-protein  conjugates  in  the  ump1L 
mutant.  At  the  same  time,  the  level  of  free  Ub  was  re¬ 
duced  in  umplA  cells  (data  not  shown).  Thus,  the  pri¬ 
mary  cause  of  the  umpl^  phenotype  appears  to  be  the 
impaired  ability  of  ump1A  cells  to  degrade  Ub-protein 
conjugates.  Homozygous  umplA  diploids  (strain  JD61) 
were  unable  to  sporulate  (data  not  shown). 

Umpip  Is  a  Component  of  Proteasome  Precursors 
To  investigate  Umpi  p  biochemically,  an  epitope  tag  was 
linked  to  Its  C  terminus.  A  single  copy  of  the  modified 
UMPI  gene  {UMP1-ha),  expressed  from  its  natural  pro¬ 
moter  and  chromosomal  location,  restored  wt  growth 
rates.  We  analyzed  whole-cell  extracts  of  a  strain  ex¬ 
pressing  Umpip-ha  and  Prelp-ha  (the  latter  an  ha- 
tagged  p  subunit  of  the  proteasome)  by  gel  filtration  on 
Superose-6.  Umpip-ha  eluted  in  fractions  22  and  23, 
both  of  which  also  contained  Prel  p-ha  and  other  protea - 
somal  subunits  (Figures  2A,  2B,  and  3A)  but  lacked  the 
chymotrypsin-like  activity  of  the  mature  20S  protea¬ 
some.  The  apparent  size  of  the  Umpi  p-containing  com¬ 
plex  was  300-400  kDa  (Figure  2A).  20S  and  265  protea¬ 
somes,  as  well  as  free  195  caps,  and  205  proteasomes 
with  one  attached  195  cap,  eluted  in  earlier  fractions 
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Figure  1.  Umpip  Is  Required  for  Ubiquitin- 
Mediated  Proteolysis 

(A)  Sequence  similarities  between  Umpip 
and  C-terminal  parts  of  mouse  contrapsin, 
human  alphal  antitrypsin  inhibitor  (alact)  and 
mouse  spi-2  (PIR  accession  numbers,  re¬ 
spectively,  JX0129,  A90475,  and  S31305). 
The  sequences  were  aligned  using  the  PileUp 
program  (GCG  package,  version  7.2,  Genet¬ 
ics  Computer  Group,  Madison,  Wl).  Gaps 
(indicated  by  hyphens)  were  used  to  max¬ 
imize  alignments.  Residues  identical  between 
Umpi  p  and  at  least  one  of  the  other  proteins 
are  shaded  in  gray.  Residues  identical  among 
at  least  two  proteins  other  than  Umpip  are 
boxed. 

(B)  Pulse-chase  analysis  comparing  the  met¬ 
abolic  stabilities  of  R-p-gal,  Ub-P-p-gal,  and 
MATa2,^-p-gal  in  wt  (UMPI)  and  umpl^ 
cells.  The  open-ended  brackets  denote  the 
position  of  multiubiquitylated  p-gal  species. 
Asterisks  denote  an  kDa  p-gal  cleavage 
product  characteristic  of  short-lived  p-gal  de¬ 
rivatives  (Dohmen  et  al.,  1991). 

(C)  Growth  of  umpIL  (JD59),  UMPI  (JD47- 
13C),  and  ump1^  cells  transformed  with 
YCpSO- UMPT  (expressing  wild-type  UMPT). 
Cells  were  streaked  on  YPD  plates  and  Incu¬ 
bated  for  2  days  at  30^  (or  at  37®C  where 
indicated),  with  0.8  pLg/ml  canavanine  or  30 
p,m  CdClz  where  indicated. 


(15-21),  as  determined  by  nondenaturing  gel  electro¬ 
phoresis  (Figure  2B  and  data  not  shown),  by  assays  of 
the  proteasome's  proteolytic  activity,  and  by  immuno¬ 
logical  detection  of  Cim3p,  a  subunit  specific  for  the 
19S  cap  of  the  26S  proteasome  (Ghislain  et  al.,  1993). 

Cochromatography  of  Umpi -ha  and  Prelp-ha  in  frac¬ 
tions  22  and  23  (Figure  2A)  suggested  that  Umpip  is  a 
component  of  proteasome  precursor  complexes.  Con¬ 
sistent  with  this  possibility,  the  Umpi  p-containing  com¬ 
plex  had  a  higher  mobility  than  the  20S  proteasome 
upon  nondenaturing  gel  electrophoresis.  A  proteasome 
precursor  with  an  electrophoretic  mobility  indistinguish¬ 
able  from  that  of  the  Umpi  p-containing  complex  was 
also  detected  in  extracts  from  a  strain  that  expressed 
Pupip-ha,  another  p  subunit  of  the  20S  proteasome 
(Figure  2B). 

To  produce  independent  evidence  bearing  on  the  na¬ 
ture  of  the  Umpi  p-containing  complex,  we  constructed 
a  strain  that  expressed  both  Umpip-ha  and  doubly 
tagged  Prelp-Flag-Hise  (Prelp-FH).  Affinity  chromatog¬ 
raphy  on  Ni-NTA-agarose  and  anti-Flag  antibody  resin 
was  then  used  to  purify  complexes  containing  Prelp- 
FH.  Umpip-ha  cofractionated  with  Prelp-FH  in  both 
affinity  purification  steps  (Figures  2C  and  2D),  confirm¬ 
ing  that  Umpi  p  is  a  component  of  a  distinct  proteasome- 
related  complex.  This  complex  sedimenteci  at  ^15S  in 
sucrose  gradients  (data  not  shown).  Taken  together, 


this  evidence  strongly  suggested  that  the  Umpi  p-con¬ 
taining  complex  is  a  precursor  of  the  20$  proteasome 
that  Is  similar  to  the  15-163  precursors  observed  in  the 
maturation  pathway  of  mammalian  proteasomes  (Frent- 
zel  et  a!.,  1994;  Yang  et  al.,  1995). 


The  Umpip  Proteasome  Precursor  Complex 
Contains  Unprocessed  p  Subunits 
Several  subunits  of  the  proteasome  are  synthesized  as 
precursors  containing  N-terminal  extensions  (propep¬ 
tides)  that  are  absent  from  the  mature  proteasome. 
These  precursors  are  detected  in  the  mammalian  15-16S 
complex  (Frentzel  et  al.,  1994;  Yang  et  a!.,  1995).  To 
compare  the  Umpip-contalning  complex  with  the  mam¬ 
malian  15-16S  complex,  we  analyzed  the  former  for  the 
presence  of  propeptide-containing  proteasomal  sub¬ 
units.  These  p  subunits  were  modified  by  the  addition  of 
C-terminal  ha  tags.  Strains  expressing  Pupl  p-ha,  Pre2p- 
ha,  and  Pre3p-ha  grew  at  wt  rates  (data  not  shown).  When 
extracts  of  these  strains  were  fractionated  by  gel  filtra¬ 
tion  on  Superose-6,  the  fractions  containing  the  Umpip 
complex  contained  largely  the  p  subunit  precursors, 
proPupl-ha,  proPre2p-ha,  and  proPre3p-ha  (Figure  3A). 
The  corresponding  mature  p  subunits  largely  eluted  in 
the  earlier  fractions  that  contained  20$  and  26S  protea¬ 
somes  (Figure  3A).  Subtle  but  reproducible  differences 
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Figure  2.  Umpi  p  Is  a  Component  of  Proteasome  Precursor  Com¬ 
plexes 

(A)  An  extract  from  strain  JD127  expressing  Umpip-ha  and  Prelp- 
ha  instead  of  wt  versions  of  these  proteins  was  fractionated  by 
gel  filtration  on  Superose-6.  The  upper  panel  shows  the  results  of 
measurements  of  chymotrypsin-like  activity  in  the  relevant  fractions 
(12-29).  The  last  fraction,  representing  the  void  volume  (Vo),  and 
positions  of  the  peaks  of  marker  proteins  are  indicated  by  dashed 
lines.  The  same  fractions  were  analyzed  by  SDS-PAGE  and  immu- 
noblotting  with  anti-Cim3p  and  anti-ha  antibodies,  which  detected 
Cim3p,  Prelp-ha,  and  Umpip-ha,  as  indicated. 

(B)  Analysis  of  Superose-fractionated  20S  proteasomes  and  precur¬ 
sor  complexes  by  nondenaturing  gel  electrophoresis  and  immu- 
noblotting  with  anti-ha  antibody.  The  strains  were  JD139  and  JD129 
expressing,  respectively,  Pupip-ha  and  Umpip-ha.  Asterisk,  20S 
proteasome  with  one  attached  19S  regulator  cap.  The  26S  protea¬ 
some  eluted  in  fractions  14-17  of  Superose-6  (not  shown). 

(C)  Affinity  purification  of  Prelp-FH  on  Ni-(NTA)-Sepharose  and  co- 
purification  of  Umpip-ha.  Purification  of  proteins  from  an  extract 
of  strain  JD126  expressing  Umpip-ha  and  Prelp-FH  was  carried 
out  as  described  in  the  Experimental  Procedures.  Different  fractions 
were  analyzed  by  SDS-PAGE  and  immunoblotting  with  anti-Flag 
(upper  paneO  and  anti-ha  antibodies  (lower  panel).  Ce,  crude  extract; 
ft,  flow-through.  In  contrast,  no  binding  of  Umpip  to  the  resin  was 
observed  when  extracts  from  strain  JD129  expressing  untagged 
Prelp  were  used  (not  shown). 

(D)  Affinity  purification  of  Prelp-FH  on  anti-Flag  antibody  agarose 
resin.  Material  that  was  eluted  from  the  Ni-(NTA)-Sepharose  column 
shown  in  panel  C  was  subjected  to  a  second  affinity  purification 
on  anti-Flag  antibody  resin.  Prelp-FH  was  specifically  eluted  with 
Increasing  concentrations  of  the  Flag  peptide.  W,  wash. 


were  detected  among  the  tagged  p  subunits  with  re¬ 
spect  to  the  distribution  of  their  precursor  and  pro¬ 
cessed  forms  in  various  fractions.  For  example,  pro- 
Pup1p-ha  was  detectable  only  in  fractions  22  and  23, 
and  was  absent  from  fraction  21 .  By  contrast,  proPre2p- 
ha  was  detectable  In  the  larger  complexes  (down  to 
fraction  19),  while  fractions  22  and  23  already  contained 
some  processed  Pre3p-ha.  These  patterns  suggested 
a  defined  order  of  processing  events.  One  conclusion 
from  these  experiments  is  that  the  Umpip-containing 
complex  Is  a  precursor  of  the  20S  proteasome  that  con¬ 
tains  unprocessed  p  subunits  and  is  therefore  proteolyti- 
cally  inactive  (Figures  3A  and  30). 

Umpip  Is  Required  for  Correct 
Proteasome  Maturation  * 

Next,  we  asked  what  effect  the  ump1A  mutation  has  on 
maturation  and  activity  of  the  proteasome.  Specifically, 
the  analyses  described  above  were  repeated  with  ex¬ 
tracts  obtained  from  ump1A  cells.  Figure  30  shows  that 
there  was  a  significant  reduction  In  the  three  proteolytic 
activities  of  the  proteasome  in  the  fractions  containing 
the  20S  and  26S  proteasomes  (fractions  14-21).  The 
reduction  of  the  specific  activity  appears  to  be  partially 
compensated  by  an  increased  expression  of  protea¬ 
somes  (Figures  3A  and  3B,  data  not  shown)  similar  to 
that  observed  in  ceils  expressing  mutant  p  subunits 
(Arendt  and  Hochstrasser,  1997).  In  addition,  a  new  (ab¬ 
sent  from  wt  cells)  peak  of  chymotrypsin-like  activity 
encompassing  fractions  22  and  23  was  detected  in  ex¬ 
tracts  from  umplA  cells.  These  were  the  fractions  that 
contained  the  Umpip  complex  from  wt  extracts  (see 
above  and  Figure  3A),  suggesting  that  the  absence  of 
Umpip  from  the  proteasome  precursor  complex  was 
the  cause  for  premature  activation  of  its  chymotryptic 
activity  in  the  ump1A  mutant.  To  verify  that  the  detected 
activity  was  actually  of  the  ~15S  precursor  complex 
from  umpl^  cells,  the  fraction  22  samples  from  strains 
expressing  either  wt  Pre2p  or  Pre2p-ha  were  incubated 
with  anti-ha  antibody  and  protein  A  Sepharose.  This 
treatment  did  not  deplete  the  chymotryptic  activity  from 
the  sample  containing  untagged  Pre2p  but  did  deplete 
^80%  of  the  activity  from  an  otherwise  identical  sample 
derived  from  ump1A  cells  expressing  Pre2p-ha  (data  not 
shown).  Thus,  the  chymotryptic  activity  of  this  fraction 
resided  in  the  proteasome  precursor  complexes. 

To  follow  the  appearance  of  the  precursor  and  mature 
forms  of  p  subunits  in  proteasome  precursors  and  ma¬ 
ture  proteasomes,  we  analyzed  extracts  from  umpi  A 
cells  expressing  ha -tagged  versions  of  these  subunits 
by  immunoblotting  with  anti-ha  antibody.  The  process¬ 
ing  of  the  three  analyzed  p  subunits,  Pupip-ha,  Pre2p- 
ha,  and  Pre3p-ha,  was  strikingly  different  in  umplA  and 
wt  cells  (Figure  3A).  For  all  three  subunits,  a  dramatic 
increase  of  their  precursors  was  detected  in  the  frac¬ 
tions  (14-21)  that  contained  the  20S  and  26S  forms  of  the 
proteasome.  In  addition,  different  processed  variants, 
possibly  representing  processing  intermediates  of  pro- 
Pre2p,  could  be  detected  in  fractions  22  and  23,  which 
contained  the  M  5S  proteasome  precursor,  and  In  some 
of  the  proteasome-containing  fractions  as  well  (Figure 
3A).  In  contrast,  with  the  exception  of  Pre3p,  almost 
no  processed  p  subunits  were  present  in  the  fractions 
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Figure  3.  Ump1p  Is  Required  for  the  Correct 
Processing  of  Proteasomal  p  Subunits 

(A)  SDS-PAGE  and  anti-ha  immunoblot  analy¬ 
ses  of  extracts  fractionated  on  Superose-6. 
The  congenic  yeast  strains  used  were  JD1 31 , 
JD132,  JD133,  JD134,  JD135,  and  JD136 
(Table  1).  Western  blots  of  the  umplA  ex¬ 
tracts  were  developed  for  a  shorter  time  to 
compensate  for  an  increased  expression  of 
proteasomal  subunits  in  the  mutant.  Aster¬ 
isks,  processing  intermediates  or  incom¬ 
pletely  processed  forms  of  Pre2p-ha  that  were 
observed  in  the  umplA  mutant  but  not  in  wt 
cells. 

(B)  Pulse-chase  analysis  comparing  the  rate 
of  p  subunit  processing  in  wt  cells  to  that  in 
umplA  cells  (same  strains  as  in  A).  Cells  were 
pulse-labeled  with  “S-Met/Cys  for  5  min.  Pro¬ 
teins  were  precipitated  with  anti-ha  antibody. 
Fluorographic  exposures  were  ^^3  times 
longer  for  the  UMP1  samples.  P,  propeptide- 
containing  precursor  form;  m,  mature  form; 
asterisks,  as  in  (A);  dots,  nonspecific  bands 
present  in  some  samples. 

(C)  Comparison  of  proteolytic  activities  in  ex¬ 
tracts  with  equal  amounts  of  total  protein  of 
UMP1  (JD127)  and  umplA  cells  (JD75)  frac¬ 
tionated  by  gel  filtration  on  Superose-6.  The 
profile  of  the  chymotrypsin-like  activity  In 
UMP1  is  the  same  as  in  Figure  2A. 


containing  the  ~15S  proteasome  precursor  from  wt 
cells,  and  no  processing  intermediates  of  Pre2p  could 
be  detected  either  (Figure  3A).  These  findings  indicate 
that  Umpi  p  is  required  to  prevent  premature  processing 
of  at  least  proPre2p  and  that  the  presence  of  Ump1p  is 
important  for  the  coordination  of  proteasome  assembly 
and  subunit  processing. 

A  defect  of  ump  7A  cells  in  the  processing  of  p  subunits 
was  also  observed  by  following  the  metabolic  fate  of 
these  subunits  in  pulse-chase  experiments.  In  wt  cells, 
most  of  the  propeptide-containing  forms  of  p  subunits 
(proPupl-ha,  proPre2p-ha  and  proPre3p-ha)  were  con¬ 
verted  into  their  mature  counterparts  during  the  30-min 
chase  (Figure  3B).  By  contrast,  in  the  umplA  mutant,  the 
bulk  of  the  p  subunit  precursors  remained  unprocessed 
during  this  time,  a  finding  consistent  with  the  results  of 
immunoblot  analyses  (Figure  3A).  In  the  case  of  Pre2p- 
ha,  pulse-chase  analysis  again  revealed  the  species  of 
intermediate  size  (putative  processing  intermediates)  in 
extracts  from  umplA  cells.  These  species  were  absent 
from  the  equivalent  samples  derived  from  wt  cells  (Fig¬ 
ure  3B). 

Umpip  Is  Degraded  upon  Formation 
of  the  20S  Proteasome 

The  experiments  above  demonstrated  that  Umpip  is  a 
component  of  proteasome  precursors  that  is  absent 
from  the  mature  20S  proteasome.  One  possibility  was 


that  Umpip  leaves  the  precursor  complex  upon  forma¬ 
tion  of  the  20S  proteasome  from  the  two  ^15S  precur¬ 
sors  and  may  then  assist  with  another  round  of  protea¬ 
some  assembly,  thus  acting  catalyticaliy.  It  was  also 
possible  that  formation  of  the  20S  proteasome  sterically 
traps  Umpip.  Since  the  formation  of  the  20S  form  of 
the  proteasome  coincides  with  the  appearance  of  its 
proteolytic  activities  (Frentzel  et  al.,  1994;  Chen  and 
Hochstrasser,  1996),  Umpip  might  be  degraded  by  the 
newly  formed  20S  proteasome.  Pulse-chase  analysis  of 
Umpi  p-ha  in  wt  cells  showed  that  Umpi  p-ha  is  indeed 
degraded  in  vivo,  the  rate  of  its  degradation  being  similar 
to  the  rate  of  disappearance  of  proPupIp-ha,  which  is 
converted  to  Pupip-ha  upon  maturation  of  the  protea¬ 
some  (compare  Figures  4A  and  3B). 


Umpip  Is  Stabilized  in  Proteasome  Mutants  and 
Can  Be  Detected  Inside  the  20S  Particle 
If  Umpip  becomes  a  substrate  of  the  newly  formed 
20S  proteasome,  the  degradation  of  Umpip  should  be 
inhibited  by  mutations  that  affect  the  proteasome's  pro¬ 
teolytic  activities.  We  used  pulse-chase  assays  to  follow 
the  metabolic  fate  of  Umpi  p-ha  in  the  pre7-7  mutant, 
which  is  known  to  be  deficient  in  the  chymotryptic  activ¬ 
ity  of  the  proteasome  (Heinemeyer  et  al.,  1993).  Umpip 
was  partially  stabilized  In  pre7-7  cells,  as  indicated  by 
the  increased  amount  of  ^®S-labeled  Umpip  in  preUI 
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extracts  versus  wt  extracts  that  contained  the  same  total 
amount  of  TCA-precipitable  (Figure  4A).  However,  we 
still  observed  a  significant  decrease  of  pulse-labeled 
Umpi  p  upon  increasing  chase  times  when  the  immuno- 
precipitation  was  carried  out  under  nondenaturing  con¬ 
ditions  (Figure  4A).  By  contrast,  when  the  extracts  were 
treated  with  0.4%  SDS  at  100°C  before  immunoprecipi- 
tation,  virtually  no  decrease  of  the  Umpip  signal  during 
the  chase  was  observed  in  pre7-7  cells.  These  results 
were  consistent  with  the  possibility  that  the  newly 
formed  Umpip-ha  became  Inaccessible  to  the  anti-ha 
antibody  during  the  chase  because  It  became  trapped 
within  the  newly  formed  pre7-7  proteasome.  If  so, 
Umpip  was  expected  to  be  present  in  fractions  from 
the  Superose-6  column  that  corresponded  to  the  20S 
and  26S  proteasomes  in  the  pre7-7  mutant.  Indeed, 
whereas  in  wt  (PRE1)  cells  Umpip-ha  was  detected  by 
SDS-PAGE  and  immunoblotting only  infractions  22  and 
23  (corresponding  to  the  ~15S  proteasome  precursor 
complex),  in  the  mutant  (pre7-7)  cells  Umpip-ha  was 
detected  in  fractions  15-23,  indicating  that  it  was  also 
present  in  mature  proteasomes  (Figure  4B).  We  assayed 
the  accessibility  of  Umpip-ha  to  antl-ha  antibody  by 
immunoprecipitation  from  fraction  19  (the  205  protea¬ 
some)  of  extracts  from  the  pre7  -  7  mutant  and  from  frac¬ 
tion  22  (the  ~1 55  proteasome  precursor)  of  extract  from 
PRE1  cells.  Umpip-ha  could  be  immunoprecipitated 
from  either  fraction  19  or  22  following  pretreatment  with 
0.1  %  5D5,  but  not  under  nondenaturing  conditions  (Fig¬ 
ure  4C),  indicating  that  the  ha  tag  was  inaccessible  to 
the  anti-ha  antibody  in  both  the  pre7-7  proteasome  and 
the  precursor  complex.  However,  when  otherwise  iden¬ 
tical  assays  were  carried  out  with  a  polyclonal  anti- 
Umpl  p  antiserum,  most  of  Umpi  p  could  be  immunopre¬ 
cipitated  under  nondenaturing  conditions  from  fraction 
22  (the  ~155  proteasome  precursor)  but  was  still  not 
immunoprecipitated  from  fraction  19  (the  205  pre7-7 
proteasome).  In  contrast,  when  polyclonal  anti-prolea- 
some  antibodies  were  used,  Umpip  was  precipitated 
quantitatively  along  with  both  complexes  (Rgure  4C). 

These  results  indicated  that  a  part  of  Umpip  other 
than  its  C  terminus  is  accessible  to  anti-UmpI  p  antibod¬ 
ies  in  the  ~155  precursor.  However,  Umpip  becomes 
entirely  inaccessible  (under  nondenaturing  conditions) 
upon  formation  of  the  205  proteasome  from  two  ^155 
precursors.  This  interpretation  was  supported  by  exam¬ 
ining  the  sensitivity  of  Umpi  p-ha  in  different  complexes 
to  trypsin  digestion  (Figure  4D).  5pecifically,  Umpip-ha 
in  the  205  proteasomes  (fraction  19)  from  pre7-7  cells 
was  completely  protected  against  trypsin.  In  contrast, 
Umpip-ha  in  the  ~155  proteasome  precursors  was  de- 
tectably  accessible  to  trypsin.  The  degradation  of  Umpip 
was  incomplete,  resulting  in  a  protected  fragment  of 
Umpi  p-ha  that  lacked  the  -^5  kDa  N-terminal  region  but 
retained  the  C-terminal  ha  tag  (Figure  4D).  Interestingly, 
in  a  similar  experiment  with  a  strain  expressing  pro- 
Pre2p-ha,  we  observed  that,  in  the  Umpi -containing 
~15S  proteasome  precursor  complex  (fraction  22),  pro- 
Pre2p-ha  was  shortened  by  trypsin  treatment  to  yield  a 
product  whose  electrophoretic  mobility  was  indistin¬ 
guishable  from  that  of  the  natural  mature  Pre2p-ha  (Fig¬ 
ure  4D).  In  these  experiments,  the  overall  structure  of 
the  Umpip  proteasome  precursor  complex  remained 
intact  as  judged  by  native  gel  analysis  of  trypsin-treated 
material  (data  not  shown). 


PRB1  pml-l 


0  10  20  30  0  10  20  30 

chase  time  (min) 


Umpip-ha 

fraction  12131415  16  17  18  19  20  21  22  23  2425  26272829 


PRE1 


preUI 


C 


prel-l 
frat^ion  19 

PRB1 
fraction  22 


Umpip-ha 


-SDS 

+  SDS 

-  SDS  +  SDS 

S  P 

S  P 

S  P  S  P 

If-  ■  : 

IP  antl-ha  anti-Ump1p 


-SDS 

P 

a 


D 


Umpip-ha  Pre2p-ha 

pref-f  PRE1  PRE1 

fraction  19  fraction  22  fraction  22 


0  0.2  0.4  0.6  1.0  0  0.2  0.4  0.6  1.0  0  0.2 


trypsin  (pg/ml) 


Figure  4.  Umpip  Is  Stabilized  in  pre1-1  Mutants  Cells  that  Are  De¬ 
fective  in  the  Proteasome's  Chymotrypsln-like  Activity  and  Persists 
in  20S  Proteasomes 

(A)  Pulse-chase  analysis  of  Umpip-ha  in  wt  and  pre7-7  cells.  The 
strains  used  were  JD150  (PREf)  and  JD151  (pre7-7)  (Table  1),  both 
expressing  Umpi  p-ha.  The  cells  were  pulse-labeled  with  ^S-Met/ 
Cys  for  5  min.  Extracts  were  prepared  from  samples  taken  at  differ¬ 
ent  chase  times.  Samples  were  then  split  in  halves.  One-half  SDS) 
was  subjected  to  immunoprecipitation  with  anti-ha  antibody  ac¬ 
cording  to  the  standard  pulse-chase  protocol.  The  other  half  (+SDS) 
was  adjusted  to  0.4%  SDS  and  incubated  at  100®C  for  5  min,  then 
diluted  with  extraction  buffer  to  the  final  concentration  of  0.1  %  SDS 
prior  to  immunoprecipitation. 

(B-D)  SDS-PAGE  and  anti-ha  immunoblot  analyses.  (B)  Detection 
of  Umpip-ha-containing  complexes  in  extracts  fractionated  by  Su¬ 
perose-6  gel  filtration.  Strains  were  the  same  as  in  (A).  Note  the 
accumulation  of  Umpip-ha  in  fractions  containing  the  20S  and 
26S  proteasome  (fractions  15-21)  in  pre7-7.  (C)  Immunoblot  analy¬ 
ses  of  immunoprecipitations  were  carried  out  with  material  from 
Superose-6  fraction  19  of  extracts  from  strain  JD151  and  fraction 
22  of  extracts  from  strain  JD150,  using  the  indicated  antibodies. 
Immunoprecipitations  were  performed  without  addition  of  SDS 
(-SDS)  or  after  boiling  in  the  presence  of  0.1%  SDS  (+SDS).  P, 
precipitates;  S,  supernatants.  (D)  Assaying  trypsin  sensitivity  of  the 
same  material  as  in  (C),  and  of  the  Superose-6  fraction  22,  from  strain 
JD138  expressing  Pre2p-ha.  P,  proPre2p-ha:  m,  mature  Pre2p-ha; 
asterisks  mark  trypsin  digestion  products. 


In  the  umpi  A  Mutant,  the  Propeptide  of  Pre2p 
Is  Not  Required  for  Incorporation 
of  Pre2p  into  the  Proteasome 
Chen  and  Hochstrasser  (1996)  have  elegantly  demon¬ 
strated  that  the  propeptide  of  Pre2p,  if  separated  from 
the  mature  Pre2p,  could  function  in  trans  and  thereby 
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Figure  5.  The  Pre2p  Propeptide  Is  Not  Essential  in  the  ump^  Mutant 

(A)  Representation  of  different  genotypes  of  a  set  of  congenic  strains 
and  their  effects  on  cell  viability.  Strain  1  (JD50-1  OB)  is  the  wt  control. 
Strain  2  (MHY952)  expresses  Pre2p/Doa3p-ALS  with  N-terminal  Thr 
ni  as  a  fusion  to  ubiquitin  (not  shown),  and  the  Pre2p/Doa3p  propep¬ 
tide  (LS)from  two  separate  plasmids  (Chen  andHochstrasser,  1996). 
Strain  3  (JD160),  same  as  strain  2  but  ump1^.  Strain  4  would  be 
derived  from  strain  2  through  the  loss  of  the  plasmid  encoding 
Pre2p/Doa3p-LS  that  operates  in  trans.  We  were  unable  to  produce 
isolates  that  lost  this  plasmid,  confirming  the  earlier  demonstration 
that  the  propeptide  is  essential  for  cell  viability  (Chen  and  Hochstras- 
ser,  1996).  Strain  5  (JD163)  Is  derived  from  strain  3  through  the  loss 
of  the  propeptide-encoding  plasmid.  Loss  of  the  plasmid  did  not 
affect  cell  viability  in  the  umplA  background. 

(B)  Induction  of  UMP1  expression  in  the  ump^^  background  Inhibits 
the  growth  of  a  strain  that  lacks  the  Pre2p  propeptide.  Strains  3 
(plus)  and  5  (minus)  shown  in  (A)  were  transformed  with  pJD429  that 
expressed  UMP1  from  the  galactose-inducible,  glucose-repressible 
Pgau  promoter,  or  with  an  empty  vector  as  a  control.  Transformants 
selected  on  glucose  media  were  pregrown  on  selective  media  con¬ 
taining  raffinose  as  a  carbon  source,  then  streaked  onto  selective 
media  with  2%  galactose  or  2%  glucose,  and  grown  for  3  days  at 
30X. 

Still  allow  the  incorporation  of  Pre2p  into  the  20S  protea¬ 
some.  Under  these  conditions,  expression  of  the  pro¬ 
peptide  is  essential  for  cell  viability,  suggesting  that  the 
propeptide,  in  c/s  or  at  least  in  trans,  is  required  for  the 
assembly  of  an  active  proteasome  (Chen  and  Hochstras- 
ser,  1996). 

We  confirmed  this  result  using  their  strain  MHY952, 
in  which  mature  Pre2pandthe  propeptide  are  expressed 
on  separate  plasmids  (Figure  5A).  Specifically,  under 
nonselective  growth  conditions  this  strain  did  not  yield 
viable  cells  that  had  lost  the  plasmid  expressing  the 


propeptide.  Surprisingly,  however,  when  we  constructed 
and  examined  a  congenic  umplA  derivative  of  this 
strain,  we  noticed  that  it  lost  the  propeptide-expressing 
plasmid  at  a  frequency  suggesting  that  this  plasmid  did 
not  provide  a  significant  growth  advantage  to  cells  (data 
not  shown).  Indeed,  growth  rates  of  cells  with  the  plas¬ 
mid  were  indistinguishable  from  those  lacking  it  (Figure 
5B).  This  result  demonstrated  that  ump1A  is  a  suppres¬ 
sor  of  a  deletion  of  the  propeptide  of  Pre2p.  If  so,  expres¬ 
sion  of  UMP1  in  the  {umplA  PRE2-£^pro)  background 
should  be  incompatible  with  cell  viability.  This  prediction 
was  confirmed  when  we  examined  growth  properties 
of  the  (ump1A  PRE2-^pro)  strain  transformed  with  a 
plasmid  expressing  UMP1  from  the  galactose-induci¬ 
ble  Pgali  promoter.  On  glucose-containing  media,  the 
growth  rate  of  this  transformant  was  indistinguishable 
from  that  of  an  otherwise  identical  transformant  carrying 
the  propeptide-expressing  plasmid.  In  contrast,  on  ga¬ 
lactose-containing  media,  the  {umplA  PRE2’Apro)  mu¬ 
tant  containing  the  Pgali- UMP1  plasmid  was  unable  to 
grow,  whereas  the  control  strains  grew  (Figure  5B).  This 
result  indicated  that  the  propeptide  of  Pre2p  is  essential 
for  the  Umpip-assisted  maturation  of  the  proteasome 
but  is  not  essential  for  (partially)  defective  maturation 
of  the  proteasome  that  takes  place  in  the  absence  of 
Umpip. 


Discussion 

Umpip,  a  Novel  Maturation  Factor 
of  the  20S  Proteasome 

We  describe  the  discovery  of  a  proteasome  maturation 
factor,  termed  Umpi  p,  whose  unique  properties  include 
a  short  in  vivo  half-life  that  Is  due  to  its  degradation 
within  the  newly  formed  proteasome.  Umpip  is  required 
for  coordination  of  the  proteasome's  physical  assembly 
and  enzymatic  activation.  In  addition,  the  normally  es¬ 
sential  propeptide  of  the  Pre2p  p  subunit  was  found 
to  become  nonessential  in  the  absence  of  Umpip.  We 
report  the  following  specific  results, 
umpi  A  Mutants  Are  Defective  in 
Ub-Mediated  Proteofysis 

They  are  sensitive  to  a  variety  of  stresses  and  accu¬ 
mulate  Ub-protein  conjugates  (Figure  1  and  data  not 
shown).  All  of  the  observed  phenotypes  of  umpi  A  cells 
are  consistent  with  the  conclusion  that  umpi  mutants 
are  impaired  in  proteasome  biogenesis  and,  conse¬ 
quently,  in  the  degradation  of  ubiquitylated  proteins. 
The  UMPI  Gene 

It  encodes  a  polypeptide  with  a  calculated  molecular 
mass  of  16.8  kDa.  No  close  sequence  homologs  of 
Umpi  p  were  detected  in  the  current  databases.  How¬ 
ever,  the  presence  of  a  small  protein  similar  in  size  to 
Umpip  in  preparations  of  the  half- proteasome  precur¬ 
sors  in  mammals  (Frentzel  et  al.,  1994;  Yang  et  al.,  1995; 
Nandi  et  al.,  1997;  Schmidtke  et  al.,  1997)  suggests  the 
presence  of  a  functional  homolog  of  Umpi  p  in  the  mam¬ 
malian  proteasome  maturation  pathway. 

Umpip  is  a  Component  of  Proteasome  Precursors 
that  Contain  Unprocessed  p  Subunits 
The  Umpip-containing  complex  has  a  molecular  mass 
of  300-400  kDa,  sediments  at  ^^15$,  migrates  signifi¬ 
cantly  faster  in  native  gels  than  the  20$  proteasome, 
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Figure  6.  A  Model  of  the  Umplp  Function  in 
Proteasome  Maturation 
Shown  is  a  schematic  view  of  the  20S  protea¬ 
some  and  its  precursor  forms,  with  the  a  and 
3  subunits  as  blue  and  green  balls,  respec¬ 
tively.  Three  of  the  ^  subunits  are  drawn  with 
extensions  that  represent  propeptides.  The  p 
subunits  in  the  front  are  drawn  transparent  in 
order  to  allow  a  view  into  the  interior  chamber 
of  the  proteasome.  Structure  A  is  a  protea¬ 
some  precursor  complex  (half  proteasome), 
characterized  by  the  presence  of  Umplp  and 
unprocessed  p  subunits.  In  step  1,  two  of 
these  precursors  join  to  build  structure  B,  a 
step  that  leads  to  conformational  or  posi¬ 
tional  shifts  of  Umplp  and  propeptides.  Con¬ 
formational  changes  of  the  propeptides  trig¬ 
ger  their  autocatalytic  processing  (step  2), 

and  activation  of  the  proteasome's  proteolytic  activities.  This  leads  to  the  degradation  of  the  chamber-entrapped  Umplp  by  the  newly  formed 
proteasome  (step  3).  Only  structures  A  and  D  are  long-lived  enough  to  be  detected  in  wt  ceils.  The  findings  with  the  pre7-7  mutant,  in  which 
the  degradation  of  Umplp  is  inhibited,  suggest  the  existence  of  the  short-lived  intermediates  B  and  C.  See  text  for  details. 
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and  has  a  subunit  composition  that  is  highly  similar  to 
that  of  the  20S  proteasome,  as  determined  by  SDS- 
PAGE  and  immunoblot  analyses  with  anti-20S  protea¬ 
some  antibodies  (Figure  2  and  data  not  shown).  Taken 
together,  these  results  indicate  that  the  Umplp  is  con¬ 
tained  within  a  half-proteasome  precursor  complex, 
whose  counterpart  has  been  described  in  the  maturation 
pathway  of  mammalian  proteasomes  (Frentzel  et  al., 
1994;  Yang  et  al.,  1995;  Nandi  et  al.,  1997;  Schmidtke 
et  al.,  1997). 

Maturation  of  Pupip,  Pre2p,  and  PreSp  Is  Strongly 
Impaired  in  the  umpi  A  Mutant 
Considerably  increased  amounts  of  the  unprocessed 
(propeptide-containing)  forms  of  these  p  subunits  were 
detected  in  the  20S  and  26S  fractions  from  ump 7A  cells 
(Figure  3).  In  addition,  the  proteasomes  assembled  in 
umpTA  cells  had  reduced  activity.  Thus,  Umplp  is  re¬ 
quired  for  the  correct  and  efficient  maturation  of  the 
proteasomes. 

Partially  Processed  Forms  ofPro-PreZp  Are  Detected 
in  the  Proteasome  Precursor  Complex  and  the 
Proteasome  of  ump1A  Cells  but  Not  in  WT  Cells 
These  incompletely  and  prematurely  processed  forms 
of  Pre2p  (Figures  3A  and  3B)  appear  to  underlie  a  chy- 
motrypsin-like  activity  that  was  associated  with  the  pre¬ 
cursor  complex  in  ump7A  cells  but  was  absent  from  the 
same  complex  of  wt  cells  (Figure  3C).  This  was  sug¬ 
gested  by  the  observation  that  this  activity  was  not  in¬ 
hibited  by  treatment  with  tactacystin  and  was  absent 
from  cells  lacking  the  Pre2p  propeptide  (P.  C.  R.  and 
R.  J.  D.,  unpublished  data).  These  findings  indicated 
that  Umplp  has  a  dual  role  in  proteasome  maturation. 
Specifically,  Umplp  prevents  premature  processing  of 
proPre2p  in  the  precursor  complex  and  Is  also  required 
for  the  correct  maturation  of  active  sites  upon  assembly 
of  the  proteasome  (see  below). 

Umplp  Is  Degraded  by  the  Newly 
Formed  Proteasome 

The  kinetics  of  the  rapid  degradation  of  Umpi  p  Is  similar 
to  the  kinetics  of  processing  of  Pupip  (Figures  3B  and 
4A),  suggesting  that  Umplp  is  destroyed  by  the  protea¬ 
some  upon  its  formation  from  the  two  half-proteasome 
precursors  and  the  accompanying  maturation  of  active 


sites  (Chen  and  Hochstrasser,  1996).  This  conclusion 
was  strongly  supported  by  the  observation  that  Umplp 
is  significantly  stabilized  in  the  pre7-7  mutant,  which  is 
deficient  in  the  proteasome's  chymotrypsin-like  activity 
(Figure  4A).  Specifically,  in  contrast  to  the  pattern  in  wt 
cells,  Umplp  was  detectable  in  the  20S  and  26S  forms 
of  the  pre7-7  proteasome.  Umplp  in  these  complexes 
was  shielded  both  from  detection  by  antibodies  and 
from  digestion  by  trypsin  (Figures  4B-4D).  These  and 
related  data  suggested  a  model  in  which  Umplp  is  en¬ 
cased  within  the  20S  proteasome  upon  Its  assembly 
from  the  two  Umpip-containing  half-proteasome  pre¬ 
cursor  complexes  (Figure  6).  Formation  of  the  20S  struc¬ 
ture  triggers  active  site  maturation,  resulting  in  the  deg¬ 
radation  of  Umplp. 

The  Propeptide  of  PreZp  Is  Not  Essential 
fnumplA  Cells 

Chen  and  Hochstrasser  (1996)  demonstrated  that  the 
propeptide  of  Pre2p  is  essential  for  cell  viability  and  that 
this  propeptide  can  operate  in  trans.  They  concluded 
that  the  propeptide,  in  addition  to  rendering  proteasome 
precursors  proteolytically  inactive,  serves  a  chaperone - 
like  function  required  for  the  correct  incorporation  of 
Pre2p  into  the  proteasome.  One  striking  result  of  the 
present  work  is  that  the  propeptide  of  Pre2p  becomes 
dispensable  in  the  umpi  A  mutant  (Figure  5).  In  fact, 
the  presence  of  the  propeptide,  which  is  essential  for 
viability  of  wt  cells,  does  not  provide  a  growth  advantage 
to  ump  7A  cells  that  express  it. 

On  the  Functions  of  Umplp  and  p  Subunit 
Propeptides  in  Proteasome  Maturation 
Our  results  suggested  a  model  illustrated  in  Figure  6. 
This  model  Is  based  in  part  on  the  idea  described  by 
Chen  and  Hochstrasser  0996)— that  the  active  sites  of 
the  proteasome  are  formed  upon  its  assembly  from  two 
half-proteasome  precursors,  through  a  juxtaposition  of 
subunits  at  the  interface  of  the  dyad-related  halves.  The 
advantage  of  this  mechanism  is  that  autocatalytic  matu¬ 
ration  of  the  active  sites  (through  processing  of  the  rele¬ 
vant  p  subunits)  is  coupled  to  the  assembly  of  the  pro¬ 
teasome,  thus  avoiding  premature  processing  of  the 
propeptides  of  p  subunits.  This  view  was  supported  by 
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Table  1.  Yeast  Strains 

Strain 

Relevant  Genotype 

Source/Comment 

YPH500 

MATa.  ade2‘101  his3’^200  Ieu2-M  Iys2-801  trp7-A63  ura3-52 

Sikorski  and  Hieter,  1989 

JDC9-11 

MATa  ump1-1 

Derivative  of  YPH500 

JD47-13C 

MATa  his3-^200  Ieu2-3,112  Iys2-801  t^p1-^63  ura3-52 

Dohmen  et  al.,  1995 

JD59 

MATa  ump1-M::HIS3 

Derivative  of  JD47-13C 

JD53 

MATa 

Derivative  of  JD47-13C 

JD81-1A 

MATa  ump1-M::HIS3 

Derivative  of  JD53 

JD75 

MATa  PRE1-ha::Ylplac211  ump1-M::HIS3 

Derivative  of  JD47-13C 

JD126 

MATa  UMP1-ha::Ylplac128  PRE1-Flag-6His::Ylplac211 

Derivative  of  JD47-13C 

JD127 

MATa  UMPUha::Ylplac128  PRE1‘ha::Ylplac211 

Derivative  of  JD71 

JD129 

MATa  UMP1-ha::Ylplac128 

Derivative  of  JD127 

JD131 

MATa  UMP1-ha::Ylplac128  PUP1-ha::Yfplac211 

Derivative  of  JD129 

JD132 

MATa  ump1-LV:HIS3  PUP1-ha::Ylplac2n 

Derivative  of  JD59 

JD133 

MATa  UMP1-ha::Ylplac128  PRE2-ha::Ylplac21 1 

Derivative  of  JD129 

JD134 

MATa  ump1-M::HIS3  PRE2-ha::Ylplac211 

Derivative  of  JD59 

JD135 

MATa  UMP1-ha::Ylplac128  PRE3-ha::Ylplac211 

'  Derivative  of  JD1 29 

JD136 

MATa  ump1-M::HIS3  PRE3-ha::Ylplac211 

Derivative  of  JD59 

JD138 

MATa  PRE2-ha::Ylplac211 

Derivative  of  JD47-13C 

JD139 

MATa  PUP1-ha::Ylplac2n 

Derivative  of  JD47-13C 

BBY45 

MATa  his3-^200  Ieu2-3,n2  Iys2-801  trp1-1  ura3’52 

Bartel  et  al.,  1990 

JD61 

MATa/MATa  umpUM::HIS3/ump1-A2::LEU2 

Congenic  with  BBY45 

JD50-10B 

MATa  Ieu2-3,112  his3-A200  trpl-1  ura3-52 

Congenic  with  BBY45 

MHY952 

MATa  Ieu2-3,112  his3-A200  trp1-1  ura3-52 

Chen  and  Hochstrasser,  1996, 

pre2(=doa3)M::HIS3  (YCpUbDOA3ALS-Hls)  (YEpDOASts) 

congenic  with  BBY45 

JD160 

MATa  ump1-A3 

Derivative  of  MHY952 

pre2(=doa3)M::HIS3  (YCpUbDOA3ALS-His)  (YEpDOASts) 

JD163 

MATa  ump1~A3 

Derivative  of  JD160, 

pre2(==  doa3)-A  1::HIS3  {YCpUbDOA3ALS-His) 

cured  of  YEpDOASus 

WCG4a 

MATa  ura3his3-n  Ieu2-3,112 

Heinemeyer  et  al„  1 993 

YHI29/1 

MATa  prel-1  ura3  his3-11  ieu2-3,n2 

Heinemeyer  et  al.,  1 993 

JD150 

MATa  UMP1-ha::Y!plac128 

Derivative  of  WCG4a 

JD151 

MATa  pre1-1  UMP1-ha::Ylplac128 

Derivative  of  YH129/1 

biochemical  analyses  of  the  proteasome  maturation  in 
mammalian  cells  (Frentzel  et  al.,  1994;  Yang  et  al.,  1995; 
Nandi  et  a!.,  1997;  Schmidtke  et  al.,  1997). 

The  results  of  the  present  work  further  support  this 
model.  In  addition,  we  discovered  that  a  novel  factor, 
Umpi  p,  is  required  for  autocatalytic  active  site  matura¬ 
tion  and  proteasome  assembly  to  occur  In  a  coordinated 
fashion.  A  model  that  accounts  for  our  findings  and  is 
consistent  with  the  available  evidence  Is  illustrated  in 
Figure  6.  In  this  model,  Umplp  has  a  chaperone-like 
function  required  for  the  efficient  processing  of  the  pro¬ 
peptides  of  p  subunits  upon  the  assembly  of  20S  protea- 
somes.  Specifically,  Umplp  interacts  with  these  p  sub¬ 
units,  most  likely  with  their  propeptides  (structure  A  in 
Figures).  Upon  formation  of  the  20S  proteasome  (Figure 
6,  step  1),  a  conformational  change  of  Umplp,  or  a 
change  of  its  position  within  the  complex,  would  induce 
a  conformational  change  of  the  propeptide  that  results 
in  autocatalytic  processing  (Figure  6,  step  2). 

Why  is  the  propeptide  of  Pre2p  essential  for  protea¬ 
some  assembly  and  activation  only  in  the  presence  of 
Umplp?  We  propose  that  the  propeptide  of  Pre2p  is 
required  to  induce  an  alteration  of  conformation  or  posi¬ 
tion  of  Umplp  upon  the  assembly  of  the  20S  particle. 
In  this  model,  the  absence  of  the  propeptide  would  leave 
Umpi  p  in  a  position  that  is  incompatible  with  the  forma¬ 
tion  of  active  proteasome.  It  is  possible  (and  remains 
to  be  verified)  that  this  chaperone-like  function  of  a  pro¬ 
peptide  is  unique  to  Pre2p,  as  it  contains  a  much  longer 
propeptide  (75  residues)  than  the  other  p  subunits. 

In  the  model  of  Figure  6,  Umplp  is  a  metabolically 


unstable  chaperone  that  is  required  for  the  proper  (and 
properly  timed)  processing  of  p  subunits  upon  the  as¬ 
sembly  of  the  20S  proteasome  and  that  is  destroyed 
within  the  newly  activated  proteasome.  The  unusual  me¬ 
chanics  of  Umplp,  its  noncatalytic  mode  of  action,  and 
its  degradation  by  the  protease  it  helps  to  activate  char¬ 
acterize  Umpi  p  as  a  novel  type  of  molecular  chaperone. 

Experimental  Procedures 
Yeast  Media 

Yeast  rich  (YPD)  and  synthetic  (S)  minimal  media  with  2%  dextrose 
(SD)  or  2%  galactose  (SG)  were  prepared  as  described  (Dohmen  et 
al..  1995). 

Isolation  of  the  UMPI  Gene 

UMPI  was  identified  using  a  selection-based  screen  for  mutants  In 
the  N-end  rule  pathway  (reviewed  by  Varshavsky,  1996).  S.  cerevis- 
iae  strain  YPH500  (Table  1)  was  transformed  with  two  plasmids: 
pJD205,  which  expressed  a  UraSp-based  N-end  rule  reporter  sub¬ 
strate  (Arg-Tpi1p-Ura3p),  and  pRL2,  which  expressed  Arg-p-gal, 
another  N-end  rule  reporter  substrate  that  can  be  monitored  using 
X-Gal  plate  assays  (Baker  and  Varshavsky,  1995).  Previous  work 
(Dohmen  et  al.,  1994)  has  shown  that  rapid  degradation  of  a  Ura3- 
based  N-end  rule  substrate  renders  the  cells  Ura~,  whereas  mutants 
in  the  N-end  rule  pathway  that  express  the  same  UraSp-based  re¬ 
porter  are  Ura^.  In  this  screen,  Ura+  isolates  were  selected  on  SD 
media  lacking  uracil  and  were  then  tested  on  X-Gai  plates  to  veri^ 
that  the  same  isolates  were  also  defective  in  the  degradation  of  the 
Arg-p-gai  N-end  rule  substrate.  One  of  the  mutants  thus  identified 
(u/np7-7)  was  cured  of  pJD205  and  transformed  with  a  genomic 
yeast  library  (Rose  et  al.,  1987).  Six  transformants  yielded  plasmids 
with  four  overlapping  inserts  that  could  restore  the  ability  of  cells 
to  degrade  Arg-p-gal.  A  1981  bp  BamHI/Sau3A  fragment  common 
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to  all  of  the  inserts  was  sequenced  and  found  to  contain  one  com¬ 
plete  ORF  and  two  flanking  incomplete  ORFs.  Further  mapping, 
using  subcloning  and  PCR,  confirmed  that  the  complete  ORF  of  445 
bp  was  responsible  for  the  complementation.  The  UMP1  sequence 
(EMBL  database  accession  number:  AJ002557)  is  identical  to  ORF 
YBR173C,  an  ORF  subsequently  identified  by  the  yeast  genome 
project. 

Construction  of  Yeast  Strains  and  Plasmids 
Table  1  lists  the  strains  used  in  this  study.  To  construct  ump1^ 
alleles,  the  1981  bp  fragment  described  above  was  subcloned  into 
M13mp19.  Using  single-stranded  DNA  of  the  resulting  phage,  a 
synthetic  oligonucleotide  and  T4  DNA  polymerase,  the  UMP1  ORF 
was  precisely  deleted  and  replaced  by  a  Bglll  restriction  site.  The 
resulting  fragment  was  subcloned  into  pUC19,  and  the  Bglll  site 
was  used  to  insert  fragments  containing  the  HIS3,  LEU2,  or  the 
URA3  gene,  the  latter  one  being  flanked  by  two  direct  repeats  of  a 
segment  of  the  £.  coli  hisG  gene  (Alani  et  al.,  1987).  The  resulting 
deletion  alleles  {ump1-M::HIS3,  ump1-A2::LEU2,  and  t/mp7- 
A3::URA3i  were  isolated  as  BamHi  fragments,  introduced  into  5. 
cerevisiae,  and  used  to  delete  the  UMP1  gene  (Rothstein,  1991). 
umplA  strains  carrying  an  unmarked  t/mp 7 -A3 allele  (resulting  from 
recombination  between  the  hisG  repeats)  were  selected  on  plates 
containing  5-fluoroorotic  acid  (Alani  et  al.,  1987).  Construction  of 
chromosomal  ORFs  that  expressed  C  terminally  tagged  versions  of 
Umpip  or  proteasome  subunits  (Prelp,  Pre2p,  Pre3p,  and  Pupip) 
instead  of  their  wt  counterparts  was  performed  as  follows.  Using 
primers  that  contained  flanking  EcoRI  and  Kpnl  sites,  3'  portions 
of  the  respective  genes  were  amplified  by  PCR.  These  sites  were 
then  used  to  insert  the  amplified  fragments  into  integrative  plasmids 
based  on  Ylplac128  {LEU2  marked)  or  Ylplac211  {URA3  marked) 
(Gietz  and  Sugino,  1988)  that  contained  sequences  encoding  epi¬ 
tope  tags  followed  by  the  terminator  sequence  of  the  CYC1  gene 
(Tcycj)-  Each  of  the  resulting  plasmids  was  linearized  within  the  cod¬ 
ing  sequence  for  targeted  Integration  into  the  S.  cere v/s/ae  genome, 
yielding  strains  with  one  copy  of  the  respective  gene  (fused  in-frame 
to  the  tag-coding  sequence)  expressed  from  its  natural  promoter 
and  JcYCh  in  addition  to  a  3'  portion  of  the  same  gene  without 
promoter.  The  epitope  tags  used  were  double  ha  ("ha")  and  Flag- 
Hiss  C'FH”)  (Dohmen  et  al.,  1995).  Plasmid  pJD429  {CEN/URA3j  ex¬ 
pressing  a  UMP1  cDNA  from  PGAL1  was  isolated  from  a  cDNA 
library  (Liu  et  al.,  1992)  through  complementation  of  the  umplA 
mutation. 

Pulse- Chase  Analyses 

R-p-gal  (Ub-R-p-gal)  and  Ub-P-p-gal  (Bachmair  et  al.,  1986)  were 
expressed  in  the  MATa  strains  JD47-13C  or  JD59.  MATa2i^-p-gal 
was  expressed  from  the  plasmid  YCp50-a2dagi-p-gal  (Hochstrasser 
and  Varshavsky,  1990)  In  the  MATa  strains  JD53  and  JD81-1 A  (Table 
1).  Pulse  labeling  for  5  min  with  Redivue  PromIx  p*S]  (Amersham) 
followed  by  a  chase  and  immunoprecipitation  with  monoclonal 
anti-p-gal  (Promega)  or  antf-ha  (16B12,  Babco)  antibodies  were 
carried  out  as  described  by  Dohmen  et  al.  (1991).  proteins  frac¬ 
tionated  by  SDS-PAGE  were  detected  by  fluorography. 

Fractionation  of  Whole-Cell  Extracts  by  Gel  Filtration 
S.  cerevisiae  were  grown  at  30®C  in  YPD  or  in  SD  media  to  ODgoo  of 
1.4  ±  0.1,  harvested  at  3000g,  washed  with  cold  water,  frozen  in 
liquid  nitrogen,  and  stored  at  ~80®C.  Cell  paste  was  ground  to 
powder  in  a  mortar  in  the  presence  of  liquid  nitrogen.  The  extraction 
buffer  was  50  mM  Tris-HCI  (pH  7.5),  2  mM  ATP,  5  mM  MgCb,  1  mM 
DTT,  1 5%  glycerol,  used  at  2  ml  per  gram  of  pelleted  yeast  cells. 
After  centrifugation  at  31,000g  for  10  min  at  2‘’C,  the  supernatant 
was  subjected  to  a  second  centrifugation  at  60.000g  for  30  min, 
yielding  an  extract  with  the  protein  concentration  of  ^^5  mg/ml.  The 
protein  concentration  of  extracts  was  equalized  in  parallel  experi¬ 
ments.  using  the  extraction  buffer.  Using  the  FPLC  system  (Phar¬ 
macia).  200  \i.\  samples  of  an  extract  were  chromatographed  on  a 
Superose-6  column  equilibrated  with  extraction  buffer.  The  flow 
rate  was  0.3  ml/min  and  fractions  of  0.6  ml  were  collected.  The 
Superose-6  column  was  calibrated  using  the  following  standards: 
thyroglobulin  (669  kDa),  ferritin  (443  kDa),  and  bovine  serum  albumin 
(66  kDa).  Dextran  blue  was  used  to  nranitor  the  void  volume. 


Assays  for  Proteolytic  Activities  with  Fluorogenic 
Peptide  Substrates 

To  determine  the  chymotrypsin-like  activity,  20  \l\  of  the  protein 
fraction  and  20  m-I  of  0.5  mM  succinyl-Leu-Leu-Val-Tyr-7-amido-4- 
methylcoumarin  In  50  mM  Tris-HCI  (pH  7.8),  2  mM  ATP,  5  mM  MgClj, 
and  1  mM  DTT  were  mixed  and  incubated  for  1  hr  at  37X.  The 
reaction  was  stopped  by  addition  of  960  pJ  of  cold  ethanol,  and 
the  fluorescence  was  measured  at  440  nm,  using  the  excitation 
wavelength  of  380  nm.  The  trypsin-like  activity  and  the  peptldylglu- 
tamyl  peptide-hydrolyzing  activity  were  determined  with,  respec¬ 
tively,  Cbz-Ala-Ala-Arg-4MeO-p-naphthylamide  and  Cbz-Leu-Leu- 
Glu-p-naphthylamide  as  fluorogenic  peptide  substrates  (Fischer  et 
al.,  1994),  except  that  the  volume  of  the  protein  fraction  was  75  p.1, 
the  reactions  were  stopped  after  270  min  at  ZTC,  and  the  fluores¬ 
cence  emission  was  measured  at  366  nm,  using  an  excitation  wave¬ 
length  of  420  nm.  One  unit  (U)  of  a  proteolytic  activity  is  defined  as 
1  (imol  of  the  fluorophore  produced  per  min  under  these  conditions. 

Immunoprecipitation,  Electrophoresis,  and  Immunoblotting 
Immunoprecipitations  were  carried  out  using  either  monoclonal  anti¬ 
tag  antibodies  (see  below),  or  polyclonal  rabbit  antisera  raised 
against  the  yeast  20S  proteasome  (a  gift  from  K.  Tanaka),  or  against 
Umpi  p-HiSe  expressed  in  E.  colL  For  immunoblotting,  the  protein 
samples  were  boiled  for  3  min  in  the  presence  of  2%  SDS  and 
0.1  M  2-mercaptoethanol,  then  subjected  to  12%  SDS-PAGE,  and 
thereafter  transferred  onto  a  PVDF  membrane  (Millipore)  in  a  wet 
blot  system  (BioRad).  The  blots  were  incubated  with  either  rabbit 
anti-ubiquitin  (Ramos  et  al.,  1995),  or  anti-Cim3p  (Ghislain  et  al., 
1993)  polyclonal  antibodies,  or  16B12  anti-ha  (BabCo),  or  M2  anti- 
Flag  (Kodak)  monoclonal  antibodies,  and  were  processed  as  de¬ 
scribed  (Ramos  et  al.,  1995),  except  that  the  initially  blotted  proteins 
were  visualized  using  horseradish  peroxidase -conjugated  goat  anti- 
mouse  or  anti-rabbit  IgGs,  the  chemiluminescence  blotting  sub¬ 
strate  detection  system  from  Boehringer  Mannheim,  and  X-ray  films. 
Nondenaturing  4.5%  acrylamide  gel  electrophoresis  was  performed 
as  described  by  Hough  et  al.  (1 987),  and  the  gels  were  incubated  for 
1 5  min  in  transfer  buffer  containing  0.1  %  SDS  before  electrotransfer. 

Affinity  Purification  of  Proteasomes  and  Related  Complexes 
The  20S  proteasome  and  its  precursors  were  purified  from  strain 
JD126  that  expressed  Prelp-FH  and  Umpip-ha.  Crude  extracts 
were  prepared  as  described  above,  followed  by  an  exchange  of 
buffer  to  50  mM  Na-phosphate  (pH  8.0),  0.3  M  NaCI,  using  PD-10 
columns  (Phannacia).  The  extract  was  then  incubated  with  Ni-NTA 
agarose  (Qiagen)  In  batch  for  2  hr  at  4X,  followed  by  washings  and 
elution  according  to  the  manufacturer's  protocol,  except  that  the 
(pH  6.0)  washing  buffer  contained  20  mM  imidazole.  His«-tagged 
proteins  were  eluted  with  a  step  gradient  of  100-500  mM  imidazole. 
Active  fractions  containing  a  mixture  of  the  mature  proteasome  and 
its  precursors  were  pooled,  diluted  2-foId  with  50  mM  Tris-HCI  (pH 
7.5),  and  Incubated  for  2  hr  at  4X  in  batch  with  0.5  ml  anti-Flag  M2 
antibody  affinity  resin  (IBl/Eastman  Kodak)  that  had  been  equili¬ 
brated  in  TN  buffer  (0.15  M  NaCI.  50  mM  Tris-HCI,  [pH  7.5D.  After 
the  loading,  the  resin  was  washed  with  TN  buffer.  Flag-tagged  pro¬ 
teins  were  specifically  eluted  with  the  Flag  epitope  peptide  (IBI/ 
Eastman  Kodak). 
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Ubiquitin-dependent  degradation  of  intracellular 
proteins  underlies  a  multitude  of  biological  processes, 
including  the  cell  cycle,  cell  differentiation,  and  re¬ 
sponses  to  stress.  One  ubiquitin-dependent  proteo¬ 
lytic  system  is  the  N-end  rule  pathway,  whose  targets 
include  proteins  that  bear  destabilizing  N-terminal 
residues.  This  pathway,  which  has  been  characterized 
only  in  somatic  cells,  is  shown  here  to  be  present  also 
in  germ  line  cells  such  as  the  eggs  of  the  amphibian 
Xenopus  Uievis.  We  demonstrate  that  the  set  of  desta¬ 
bilizing  residues  in  the  N-end  rule  pathway  ofXenopus 
eggs  is  similar,  if  not  identical,  to  that  of  somatic  cells 
such  as  Tnammalian  reticulocytes  and  fibroblasts.  It  is 
also  shown  that  the  degradation  of  engineered  N-end 
rule  substrates  in  egg  extracts  can  be  strongly  and 
selectively  inhibited  by  dipeptides  bearing  destabiliz¬ 
ing  N-terminal  residues.  This  result  allowed  us  to  ask 
whether  selective  inhibition  of  the  N-end  rule  path¬ 
way  in  egg  extracts  influences  the  apoptosis-like 
changes  that  are  observed  in  these  extracts.  A  dipep¬ 
tide  bearing  a  bulky  hydrophobic  (type  2)  destabiliz¬ 
ing  N-terminal  residue  was  found  to  delay  the  apopto- 
tic  changes  in  egg  extracts,  whereas  dipeptides  bear¬ 
ing  basic  (type  1)  destabilizing  N-terminal  residues 
had  no  effect.  High  activity  of  the  N-end  rule  pathway 
in  egg  extracts  provides  an  alternative  to  reticulocyte 
extracts  for  the  in  vitro  analyses  of  this  pathway. 

©  1998  Academic  Press 

Key  Words:  ubiqiiitin;  proteolysis;  N-end  rule;  apop¬ 
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A  number  of  regulatory  circuits,  including  those  that 
control  the  cell  cycle,  cell  differentiation,  and  responses 
to  stress,  involve  metabolically  unstable  proteins  (1- 
4).  A  short  in  vivo  half-life  of  a  regulator  provides  a  way 
to  generate  its  spatial  gradients  and  allows  for  rapid 

^  To  whom  correspondence  should  be  addressed  at  Division  of 
Biology,  147-75,  Caltech,  1200  East  California  Boulevard,  Pasadena, 
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adjustments  of  its  concentration,  or  subunit  composi¬ 
tion,  through  changes  in  the  rate  of  its  synthesis  or 
degradation.  Damaged  or  otherwise  abnormal  proteins 
tend  to  be  short-lived  as  well. 

Features  of  proteins  that  confer  metabolic  instability 
are  called  degradation  signals,  or  degrons  (5).  The  es¬ 
sential  component  of  one  degradation  signal,  called  the 
N-degron,  is  a  destabilizing  N-terminal  residue  of  a 
protein  (6,  7).  The  set  of  amino  acid  residues  which  are 
destabilizing  in  a  given  cell  3delds  a  rule,  called  the 
N-end  rule,  which  relates  the  in  vivo  half-life  of  a 
protein  to  the  identity  of  its  N-terminal  residue.  Simi¬ 
lar  but  distinct  versions  of  the  N-end  rule  pathway  are 
present  in  all  organisms  examined,  from  mammals  to 
fungi  and  bacteria  (7). 

In  eukaryotes,  an  N-degron  comprises  at  least  two 
determinants:  a  destabilizing  N-terminal  residue  and  an 
internal  lysine  or  lysines  (8-10).  The  Lys  residue  is  the 
site  of  formation  of  a  multiubiquitin  chain  (11).  The  N- 
end  rule  pathway  is  thus  a  part  of  the  ubiquitin  (Ub)^ 
system.  Ub  is  a  76-residue  eukaryotic  protein  whose  co¬ 
valent  conjugation  to  other  proteins  plays  a  role  in  a 
multitude  of  biological  processes  (1,  12—15).  In  most  of 
them,  Ub  acts  through  routes  that  involve  the  degrada¬ 
tion  of  Ub-protein  conjugates  by  the  26S  proteasome,  an 
ATP-dependent  m\iltisubunit  protease  (16-18). 

It  was  found  that  linear  Ub  fusions  are  rapidly 
cleaved  by  Ub-specific  proteases  (UBPs)  at  the  Ub- 
protein  junction,  making  possible  the  production  of 
otherwise  identical  proteins  bearing  different  N-termi- 
nal  residues.  This  technical  advance  led  to  the  discov¬ 
ery  of  the  N-end  rule  pathway  (6,  7).  The  N-end  rule  is 
organized  hierarchically.  In  the  yeast  Saccharomyces 
cerevisiae,  Asn  and  Gin  are  tertiary  destabilizing  N- 
terminal  residues  in  that  they  function  through  their 
conversion,  by  enzymatic  deamidation,  into  the  second- 

^  Abbreviations  used:  Ub,  ubiquitin;  DHFR,  mouse  dihydrofolate 
reductase;  R-transferase,  Arg-tRNA-protein  transferase;  CSF,  C3d:o- 
static  factor;  TCA,  trichloroacetic  acid;  DTT,  dithiothreitol;  AMC, 
7-amino-methylcoumarin;  UBP,  Ub-specific  processing  protease. 
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ary  destabilizing  N-terminal  residues  Asp  and  Glu. 
The  destabilizing  activity  of  N-terminal  Asp  and  Glu 
requires  their  conjugation,  by  Arg-tRNA-protein  trans¬ 
ferase  (R-transferase),  to  Arg,  one  of  the  primary  de¬ 
stabilizing  residues  (19-21).  The  primary  destabilizing 
N-terminal  residues  are  bound  directly  by  N-recognin 
(also  called  E3),  the  recognition  component  of  the  N- 
end  rule  pathway.  In  S.  cerevisiae,  N-recognin  is  a 
225-kDa  protein,  encoded  by  UBRl,  that  targets  poten¬ 
tial  N-end  rule  substrates  through  the  binding  to  their 
primary  destabilizing  N-terminal  residues  —  Phe,  Leu, 
Trp,  Tyr,  He,  Arg,  Lys,  or  His  (7,  22).  N-recognin  has  at 
least  two  substrate-binding  sites.  The  type  1  site  is 
specific  for  the  basic  N-terminal  residues  Arg,  Lys,  and 
His.  The  type  2  site  is  specific  for  the  bulky  hydropho¬ 
bic  N-terminal  residues  Phe,  Leu,  Trp,  Tyr,  and  He  (7). 

The  currently  known  physiological  substrates  of  the 
N-end  rule  pathway  are  the  RNA  polymerase  of  Sindbis 
virus  (23),  the  GPAI -encoded  Ga  subunit  of  the  S.  cerevi¬ 
siae  heterotrimeric  G  protein  (24),  and  Cup9p,  a  short- 
hved  transcriptional  repressor  in  S.  cerevisiae  that  con¬ 
trols  the  expression  of  PTR2,  which  encodes  a  peptide 
transporter  (25).  The  Ubrlp-Cup9p-Ptr2p  circuit,  which 
controls  the  import  of  peptides  in  yeast,  is  the  first  clear 
instance  of  a  physiological  function  of  the  N-end  rule 
pathway  (25).  Among  the  cells  of  multicellular  organ¬ 
isms,  this  proteolytic  system  was  characterized  in  rabbit 
reticulocytes  (19)  and  L  cells,  a  line  of  fibroblast-like 
mouse  cells  (26).  Recently,  it  was  suggested  that  the 
N-end  rule  pathway  might  play  a  role  in  apoptosis  (pro¬ 
grammed  cell  death)  (7).  A  way  to  verify  this  conjecture 
would  be  to  use  a  mutant  that  lacks  the  N-end  rule 
pathway.  Such  mutants  were  constructed  and  character¬ 
ized  in  Escherichia  coli  and  S.  cerevisiae  (7)  but  not  yet  in 
multicellular  organisms.  Therefore  we  considered  the  use 
of  N-end  rule  inhibitors. 

Previous  work  has  shown  that  the  addition  of  amino 
acid  derivatives  such  as  dipeptides  that  bear  destabi¬ 
lizing  N-terminal  residues  to  reticulocyte  extract  re¬ 
sults  in  a  strong  and  selective  inhibition  of  the  N-end 
rule  pathway  in  the  extract  (19, 27).  Specifically,  dipep¬ 
tides  bearing  type  1  destabilizing  N-terminal  residues 
inhibited  the  degradation  of  test  N-end  rule  substrates 
bearing  basic  (type  1)  destabilizing  N-terminal  resi¬ 
dues  but  had  no  effect  on  the  degradation  of  substrates 
bearing  type  2  N-terminal  residues.  A  converse  pattern 
was  observed  with  dipeptides  bearing  bulky  hydropho¬ 
bic  (type  2)  destabilizing  N-terminal  residues  (19). 
Dipeptides  added  to  S.  cerevisiae  cultures  have  been 
demonstrated  to  inhibit  the  N-end  rule  pathway  in  vivo 
as  well  (28).  However,  experiments  with  the  N-end  rule 
pathway  of  intact  mammalian  cells  have  shown  dipep¬ 
tides  to  be  ineffective  inhibitors  in  this  setting^. 

^  F.  Levy  and  A.  Varshavsky,  unpublished  data. 


Given  this  latter  constraint,  we  sought  to  explore  the 
physiological  consequences  of  inhibiting  a  metazoan 
N-end  rule  pathway  in  the  inhibitor-accessible  setting 
of  a  cell-free  system.  We  also  wished  to  determine 
whether  the  N-end  rule  pathway  is  present  in  germ 
line  cells  such  as  the  eggs  of  the  amphibian  Xenopus 
laevis,  a  major  experimental  organism  in  studies  of 
embryogenesis  and  cell  cycle  control  (4,  29).  Our  find¬ 
ings  are  described  below. 

MATERIALS  AND  METHODS 

Construction  of  plasmids.  The  plasmids  pT7-UbMe*^DHFRhis 
and  pT7-UbRe^DHFRhis  expressed  Ub-Met-e^^-DHFR-Hisg  and  Ub- 
Arg-e^-DHFR-Hisg  fusions  (denoted,  respectively,  as  Ub-Met-DHFR 
and  Ub-Arg-DHFR)  in  E.  coli  from  the  T7  polymerase  promoter  (see 
Results  for  the  definitions  of  e*^  and  other  terms).  These  plasmids 
were  constructed  from  pEJJl-M  and  pEJJl-R  (30).  The  JYpnl-Rmd- 
III  fragment  of  pEJJl-M  and  pEJJl-R  was  replaced  with  a  synthetic 
oligonucleotide  duplex  (5'-CCATCACCATCACCATCACJTAAA-3'  and 
5^-AGCTTTTAGTGATGlGTGATGGTGATGGGTAC-3')  that  encoded 
the  Hisg  tag  and  bore  the  Kpnt  and  Hindlll  overhangs.  The  other 
pT7-UbXe^DHFRhis  plasmids,  which  encoded  otherwise  identi¬ 
cal  Ub-X-DHFR  fusions  bearing  different  X  residues  (Met,  Arg, 
Leu,  Phe,  Cys,  Asp,  or  Asn)  (Fig.  1),  were  constructed  from  pT7- 
UbRe^^DHFRhis  by  site-directed  mutagenesis,  using  PCR  (31). 

Overexpression,  labeling,  and  purification  of  Uh-X-e^-DHFR-HiSg 
proteins.  A  pT7-UbXe*^DHFRhis  plasmid  was  introduced  into  E. 
coli  BL21  (DE3)  (31).  Protein  expression  was  induced  in  E.  coli  by 
adding  isopropyl- l-thio-/3-D-galactopyranoside  (IPTG),  and  cells 
were  labeled  with  P^Slmethionine/cysteine  (^^S-Express,  New  En¬ 
gland  Nuclear,  Boston,  MA),  as  described  (30).  The  labeled  cells  were 
collected  by  centrifugation  and  disrupted  by  sonication,  and  a  ^^S- 
labeled  Ub-X-e^-DEFR-Hisg  (Ub-X-DHFR)  test  protein  was  purified 
by  affinity  chromatography  under  nondenaturing  conditions,  using 
the  Ni-NTA  Spin  Kit  (Qiagen,  Chatsworth,  CA),  and  dialyzed  against 
1  mM  MgCl2,  1  mM  dithiothreitol  (DTT),  0,1  M  Tris-HCl  (pH  7.7), 
frozen  rapidly,  and  stored  at  —  80®C  in  samples  that  were  to  be 
thawed  only  once.  The  specific  radioactivity  of  P®S]Ub-X-DHFR  pro¬ 
teins  was  ^10  X  10^  cpm/p.g. 

Degradation  assays  in  Xenopus  egg  extracts.  Cytostatic  factor 
(CSF)-arrested  egg  extracts  were  prepared  from  Xenopus  eggs  as  de¬ 
scribed  (32).  In  most  experiments,  cycloheximide  (0.1  mg/ml)  was  added 
to  the  extracts  to  preclude  remcorporation  of  E^Slmethionine  into  newly 
made  proteins.  In  some  experiments  (see  the  legends  to  figures),  egg 
extracts  were  activated  by  the  addition  of  0.4  mM  CaCl2  1  h  before  the 
assay.  P^S]Ub-X-DHFR  test  proteins  were  added  to  the  extract  to  the 
final  concentration  of  —25  yug/vd.  Dipeptides  were  added  together  with 
bestatin  (Sigma,  St.  Louis,  MO)  to  the  final  concentrations  of  10  mM 
and  50  yg/mt,  respectively.  Bestatin  was  added  to  decrease  the  degra¬ 
dation  of  dipeptides  in  the  extract  (27).  (Control  experiments  (not 
shown)  showed  that  the  addition  of  bestatin  alone  did  not  significantly 
inhibit  the  degradation  of  test  proteins  by  the  N-end  rule  pathway.)  The 
following  dipeptides  and  other  amino  acid  derivatives  were  used:  Arg- 
/3-Ala,  Ala-Lys,  Lys-Ala,  Trp-Ala  (Sigma),  and  Tyr-His  (Bachem  Sci¬ 
ence,  King  of  Prussia,  PA).  Stock  samples  of  dipeptides  were  0.5  M 
solutions  in  10  mM  K-Hepes,  pH  7.5. 

To  follow  the  degradation  of  test  proteins,  reaction  mixtures  (0.1 
ml)  prepared  as  described  above  were  incubated  at  23®C.  Samples 
(2.5  (jlI)  were  withdrawn  in  duplicate  for  each  time  point.  One  sample 
was  examined  by  SDS-12%  PAGE  and  autoradiography,  using  Phos- 
phorlmager  (Molecular  Dynamics,  Sunnyvale,  CA)  (33).  The  other 
sample  was  used  to  determine  the  relative  amount  (%)  of  ^^S  soluble 
in  5%  trichloroacetic  acid  (TCA)  (30).  This  parameter  was  calculated 
as  follows: 
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FIG.  1.  The  Ub-X-DHFR  test  proteins.  These  fusions,  identical  except  for  the  variable  residue  X  adjacent  to  the  Ub  moiety,  were  expressed 
and  radiolabeled  in  E.  coli,  purified,  and  used  as  described  under  Materials  and  Methods.  X  was  either  Met,  Arg,  Leu,  Phe,  Cys,  Asp,  or  Asn. 
The  E.  coli  Lac  repressor-derived  42-residue  sequence,  denoted  as  e^  (extension  (e)  containing  lysines  (K))  was  described  previously  (8,  30, 
37).  The  alternative  ubiquitylation  sites  Lys-15  and  Lys-17  are  boxed  in  the  sequence  of  e*^  (8).  DHFR  represents  DHFR-Hisg,  the  mouse 
DHFR  whose  C-terminus  was  extended  by  8  residues  that  included  the  Hisg  tag. 


X  a 

percent  TCA-soluble  =  y  •  ^  *  100%, 

where  X  is  the  amount  of  TCA-soluble  (cpm);  Y  is  the  total 
amount  of  (cpm)  in  the  same  sample;  a  is  the  number  of  Met 
residues  in  a  Ub-X-DHFR  test  protein  (a  =  10  for  Ub-Met-DHFR;  a  = 
9  for  the  other  Ub-X-DHFRs).  The  a  - 1  term  above  corrects  for  the 
presence  of  one  Met  residue  in  Ub. 

The  data  presented  in  each  of  the  experimental  Figures  were 
produced  using  the  same  extract  on  the  same  day.  Unless  stated 
otherwise,  the  data  were  consistent  between  at  least  two  (usually 
three  or  more)  independent  experiments  carried  out  with  different 
preparations  of  the  extract. 

In  vitro  apoptosis  assays.  Apoptotic  extracts  from  Xenopus  eggs 
were  prepared  according  to  Newmeyer  et  al.  (34).  The  difference 
between  control  and  apoptotic  egg  extracts  was  in  the  time  intervals 
between  hormone  injections  used  to  produce  the  eggs.  Frogs  were 
injected  with  75  units  of  pregnant  mare  serum  gonadotropin  (Cal- 
biochem,  San  Diego,  CA)  14-30  days  (in  contrast  to  3-10  days  for 
control  extracts)  before  they  were  induced  to  lay  eggs  by  injecting  800 
units  of  human  chorionic  gonadotropin  (Sigma).  Eggs  were  collected, 
and  the  extract  was  prepared  as  described  (32).  The  extracts  were 
used  within  1  h  after  preparation.  For  most  assays,  the  extracts  were 
arrested  in  interphase  by  the  addition  of  0.4  mM  CaCl2  and  0.1 
mg/ml  cycloheximide.  DemembranatedXenopas  sperm  nuclei  (32),  at 
'-1000  per  julI  of  extract,  were  also  added  at  this  time.  Extracts 
treated  as  described  were  distributed  into  0.1-ml  samples  and  incu¬ 
bated  at  23®C.  Apoptotic  changes  in  the  extracts  were  monitored  in 
two  ways:  by  observing  shrinkage  sind  disintegration  of  the  added 
sperm  nuclei  using  phase  contrast  and  fluorescent  microscopy  (34) 
and  by  measuring  the  DEVD-specific  protease  activity.  The  latter 
assay  determined  the  total  activity  of  caspases  that  recognize  the 
tetrapeptide  sequence  DEVD  (Asp-Glu-Val-Asp)  (35,  36). 

The  DEVD-specific  protease  activity  was  measured  by  incubating 
5  fx\  of  extract  with  a  quenched  fluorescent  substrate  Ac-DEVD-AMC 
(50  fiM)  (Bachem  Bioscience)  in  0.1  ml  of  the  extract  buffer  (15  mM 
MgCl2,  1  mM  DTT,  20  mM  EGTA,  80  mM  Na-)S-glycerolphosphate, 
pH  7  .3).  The  reaction  was  carried  out  for  10  min  at  23°C,  followed  by 
freezing  of  a  20-/nl  sample  in  liquid  Ng.  Samples  were  thawed  by 
diluting  into  1  ml  of  phosphate-buffered  saline,  and  their  fluores¬ 
cence  (excitation  at  380  nm  and  emission  at  460  nm)  was  measured 
in  the  Hitachi  F-4500  spectrofluorimeter  (35,  36).  AMC  was  used  as 
a  fluorescence  standard. 

RESULTS 

The  N-End  Rule  Pathway  in  Xenopus  Egg  Extracts 

Among  the  cells  of  midticellular  organisms,  the  N-end 
rule  pathway  has  been  identified  and  analyzed  in  rabbit 


reticuloc3d:es  (19)  and  L  cells,  a  line  of  transformed,  fibro- 
blast-like  mouse  cells  (26).  To  determine  whether  the 
N-end  rule  pathway  is  present  in  germ  line  cells  such  as 
Xenopus  eggs,  we  used  an  approach  similar  to  the  earlier 
one  with  rabbit  reticulocytes  (19,  30). 

A  set  of  N-end  rule  substrates  differing  exclusively 
by  their  N-terminal  residues  was  constructed,  ex¬ 
pressed  in  E.  coli,  metabolically  labeled  with  P^Sjme- 
thionine,  and  purified  by  affinity  chromatography  (see 
Materials  and  Methods).  A  test  protein  of  this  set, 
Ub-X-e^-DHFR-HiSg,  denoted  below  as  Ub-X-DHFR, 
contained  the  following  parts:  the  N-terminal  Ub  moi¬ 
ety;  a  variable  residue  X;  e^,  a  42-residue,  E.  coli  Lac 
repressor-derived  sequence  that  contained  the  second 
(lysine)  determinant  of  the  N-degron  (8,  37);  and  the 
Hisg-tagged  mouse  dihydrofolate  reductase  moiety 
(Fig.  1),  Ub  fusions  are  not  cleaved  at  the  Ub-protein 
junction  in  E.  coli,  which  lacks  the  Ub  system  (38).  By 
contrast,  in  eukaryotes  the  Ub  fusions,  including  the 
Ub-X-DHFRs,  are  rapidly  cleaved  by  UBPs  after  the 
last  residue  of  Ub,  making  it  possible  to  produce,  in 
vivo  or  in  vitro,  otherwise  identical  test  proteins  such 
as  X-DHFRs  that  differ  exclusively  by  their  N-terminal 
residues  (6,  7). 

CSF-arrested  Xenopus  egg  extracts  were  prepared 
from  imfertilized  frog  eggs  (32).  Specific  ^^S-labeled 
Ub-X-DHFR  test  proteins  (Fig.  1)  were  added  to  the 
egg  extracts,  and  their  metabolic  fates  were  monitored 
either  by  SDS-PAGE  and  autoradiography  or  by  mea¬ 
suring  the  amount  of  TCA-soluble  released  during 
incubation  at  23°C.  The  identity  of  residue  X  in  the 
tested  X-DHFRs  encompassed  at  least  one  representa¬ 
tive  of  each  class  of  N-terminal  destabilizing  residues: 
Asn  (tertiary  destabilizing);  Asp  and  Cys  (secondary 
destabilizing),  Arg  {type  1  primary  destabilizing),  Phe 
and  Leu  (type  2  primary  destabilizing),  and  Met  (sta¬ 
bilizing)  (see  introduction)  (7). 

As  expected,  a  Ub-X-DHFR  test  protein  was  deubiq- 
uitylated  upon  its  addition  to  the  extract.  This  was 
detected  through  an  increase  in  the  electrophoretic 
mobility  of  the  major  ^®S-labeled  species  and  the  con- 
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FIG.  2.  Deubiquitylation  and  degradation  of  Ub-X-DHFR  test  proteins  in  Xenopus  egg  extract.  (A)  ^^S-labeled,  purified  Ub-Arg-DHFR  was 
added  to  a  CSF-arrested  Xenopus  egg  extract  and  incubated  at  23°C  for  the  indicated  times.  The  samples  were  analyzed  by  SDS-PAGE,  and 
the  labeled  species  were  detected  by  autoradiography  (see  Materials  and  Methods).  (B)  Same  as  A,  but  with  Ub-Phe-DHFR.  (C)  Same  as  A, 
but  with  Ub-Met-DHFR  (only  the  area  around  Met-DHFR  is  shown).  (D)  Same  as  C,  but  with  Ub-Asp-DHFR.  (E)  Same  as  C,  but  with 
Ub-Asn-DHFR.  Open  arrowheads  indicate  the  bands  of  the  initial,  34-kDa  Ub-X-DHFR.  Closed  arrowheads  indicate  the  bands  of  deubiq- 
uitylated,  26-kDa  X-DHFR.  An  asterisk  indicates  the  band  of  free  [^^S]Ub,  which  migrated  close  to  the  dye  front  in  this  electrophoretic 
system.  The  molecular  masses  (in  kDa)  of  protein  markers  are  indicated  to  the  right  of  B. 


comitant  appearance  of  a  labeled  ^8  kDa  species  (Fig. 
2  and  data  not  shown).  Deubiquitylation  of  Ub-X- 
DHFR  in  the  extract  was  essentially  complete  in  5  min 
or  less  (Fig.  2).  Whereas  the  relative  intensity  of  the 
free  Ub  band  remained  nearly  constant  during  the 
subsequent  4-h  incubation  (Figs.  2 A  and  2B,  and  data 
not  shown),  the  relative  intensity  of  the  X-DHFR  band 
was  either  constant  or  decreased,  at  a  varying  rate, 
depending  on  the  identity  of  the  N-terminal  residue  X. 
Based  on  their  relative  metabolic  stabilities  in  the  ex¬ 
tract  (Figs.  2  and  3A,  and  data  not  shown),  the  X- 
DHFR  test  proteins  could  be  reproducibly  ranked  from 
the  long-lived  to  the  short-lived  as  follows:  Met  > 
Cys  >  Leu  >  Asp  >  Asn  >  Phe  >  Arg.  We  conclude 
that  Xenopus  egg  extracts  contain  an  N-end  rule  path¬ 
way  whose  rule  book  is  similar,  and  may  be  identical, 
to  those  found  in  rabbit  reticulocytes  (19)  and  mouse  L 
cells  (26). 

The  decay  curves  of  X-DHFRs  that  were  determined 
by  monitoring  the  release  of  TCA-soluble  and  the 
resulting  ranking  of  their  N-terminal  destabilizing  res¬ 
idues  were  reproducible  between  experiments  that  uti¬ 
lized  independent  preparations  of  the  egg  extract  (data 
not  shown).  Specifically,  it  took  —10  min  to  destroy 
50%  of  Arg-DHFR  to  acid-soluble  fragments.  By  30 
min,  the  degradation  of  Arg-DHFR  was  —90%  com¬ 
plete  (Figs.  2 A  and  3A). 

Approximately  20%  of  the  added  Met-DHFR,  which 
bore  a  stabilizing  N-terminal  residue  (7),  was  degraded 
during  the  first  10  min  of  incubation,  and  another  5  to 


15%  was  degraded  over  the  next  30  min.  However,  the 
rest  of  the  Met-DHFR  remained  stable  in  the  extract 
(Figs.  2C  and  3A).  A  likely  explanation  of  this  result  is 
that  our  preparation  contained  two  types  of  Met-DHFR 
molecules:  —70%  were  undamaged  and  long-lived  in 
the  extract,  as  observed  with  Met-DHFR  in  vivo,  in 
both  S.  cerevisiae  and  mouse  L  cells  (Ref.  8;  F.  Levy  and 
A.  V.,  unpubl.  data),  whereas  the  remainder  comprised 
misfolded  or  otherwise  damaged  Met-DHFR  molecules 
that  were  degraded  by  a  proteol3i:ic  system  distinct 
from  the  N-end  rule  pathway.  By  inference,  the  same 
background  of  degradation  of  a  subpopulation  of  mol¬ 
ecules  would  be  expected  for  other  X-DHFRs.  It  is 
unclear  whether  a  subpopulation  of  misfolded  or  oth¬ 
erwise  damaged  X-DHFRs  resulted  from  their  overex¬ 
pression  in  E,  coli,  or  whether  the  damage  occurred 
largely  during  purification  of  X-DHFRs.  Independent 
evidence  for  this  explanation  of  the  background  degra¬ 
dation  of  Met-DHFR  is  presented  below. 

Previous  work  has  shown  that  N-terminal  Cys  is  a 
stabilizing  residue  in  S.  cerevisiae  (8),  but  is  a  second¬ 
ary  destabilizing  residue  in  rabbit  reticuloc5d:e  extracts 
and  mouse  L  cells  (19,  26).  In  Xenopus  egg  extracts, 
Cys  was  found  to  be  a  weakly  destabilizing  residue. 
Specifically,  among  the  tested  X-DHFRs,  Cys-DHFR 
was  the  longest-lived  test  protein  save  for  Met-DHFR, 
which  bore  a  stabilizing  N-terminal  residue  (Fig.  3A). 
Substrates  that  bear  secondary  or  tertiary  destabihz- 
ing  residues  require  the  tRNA-dependent  activity  of 
R-transferase  for  their  degradation  by  the  N-end  rule 


THE  N-END  RULE  IN  XENOPUS 


321 


FIG.  3.  Degradation  of  X-DHFR  test  proteins  in  egg  extracts  under  different  conditions,  as  determined  by  measuring  TCA-soluble  (A) 
3^S-labeled,  purified  Ub-Arg-DHFR  (□),  Ub-Phe-DHFR  (O),  Ub-Met-DHFR  (A),  Ub-Asp-DHFR  (O),  and  Ub-Asn-DHFR  (ffi)  were  added  to 
samples  of  the  CSF-arrested  Xenopus  egg  extract.  After  incubation  at  23®C  for  the  indicated  times,  the  amounts  of  soluble  in  5%  TCA  were 
determined  for  each  time  point  (see  Materials  and  Methods).  Since  the  free  P^S]Ub,  produced  through  deubiquitylation  of  a  Ub-X-DHFR,  was 
stable  during  the  incubation,  100%  of  TCA-soluble  represented  complete  degradation  of  the  Ub-lacking  P^S]X-DHFR  (see  Materials  and 
Methods  for  further  details).  (B)  Pretreatment  of  egg  extract  with  RNAase  selectively  inhibits  degradation  of  N-end  rule  substrates  bearing 
secondary  destabilizing  N-terminal  residues.  A  CSF-arrested  extract  was  pretreated  with  5  units/ml  of  RNAase  A-agarose  (Sigma)  for  1  h  at 
23°C,  followed  by  the  removal  of  RNAase-agarose  by  centrifugation  {closed  symbol  curves).  Another  sample  of  the  same  extract  was  incubated 
for  1  h  without  RNAase  {open  symbol  curves).  ^®S-labeled  Ub-Cys-DHFR  {triangles),  Ub-Asp-DHFR  {circles),  or  Ub-Arg-DI^R  {squares)  were 
added  to  both  extracts,  and  the  relative  amounts  of  TCA-soluble  ^^S  released  during  the  incubation  were  determined  as  in  A.  (C)  Different 
states  of  egg  extract  do  not  significantly  alter  the  activity  of  the  N-end  rule  pathway.  ^^S-labeled  Ub-Phe-DHFR  was  added  to  the 
CSF-arrested  egg  extract  (□),  to  the  CaClg-treated  extract  that  proceeded  toward  mitosis  (A),  and  to  the  CaClg-  and  cycloheximide-treated, 
interphase-arrested  extract  (O).  The  degradation  of  Phe-DHFR  was  monitored  as  described  in  A. 


pathway  (see  introduction).  Previous  work  has  shown 
that  pretreatment  of  reticulocyte  extract  with  RNAase 
A  does  not  significantly  affect  the  degradation  of  N-end 
rule  substrates  bearing  primary  destabilizing  residues 
but  abolishes  the  degradation  of  otherwise  identical 
substrates  bearing  secondary  or  tertiary  destabilizing 
N-terminal  residues  (19).  In  agreement  with  these 
findings,  preincubation  of  Xenopus  egg  extract  with 
RNAase  A  selectively  inhibited  the  degradation  of  Cys- 
DHFR  and  Asp-DHFR,  but  had  no  effect  on  the  degra¬ 
dation  of  Arg-DHFR  (Fig.  3B).  We  conclude  that  both 
Cys  and  Asp  are  secondary  destabilizing  residues  in 
the  N-end  rule  pathway  of  Xenopus  eggs.  Taken  to¬ 
gether  with  the  earher  findings  about  the  N-end  rule 
pathway  in  reticulocyte  extract  (19),  the  present  find¬ 
ings  are  consistent  with  the  conjecture  that  the  set  of 
secondary  and  tertiary  destabilizing  residues  in  the 
N-end  rule  Xenopus  eggs  is  identical  to  that  of  rabbit 
reticuloc3t:es:  two  tertiary  destabilizing  residues  (Asn 
and  Gin)  and  three  secondary  ones  (Asp,  Glu,  and  Cys). 

We  also  asked  whether  the  rate  of  degradation  of  an 
N-end  rule  substrate  depends  on  physiologically  rele¬ 
vant  changes  in  the  state  of  an  egg  extract.  Unfertil¬ 
ized  Xenopus  eggs  are  arrested  at  the  metaphase  of 
meiosis  II  through  the  action  of  cytostatic  factor  (39, 
40).  Fertilization  results  in  a  transient  increase  of  the 
Ca^"^  concentration  in  the  eggs  that  inactivates  CSF, 
allowing  cells  to  enter  interphase  and  proceed  to  mito¬ 
sis.  This  process  can  be  mimicked  in  vitro  by  the  addi¬ 
tion  of  Ca^^^  (32).  Such  extracts,  referred  to  as  cycling 
extracts,  undergo  changes  characteristic  of  eggs  at  mi¬ 


tosis,  as  observed  through  alterations  in  the  morphol¬ 
ogy  of  Xenopus  sperm  nuclei  added  to  the  extract.  The 
addition  of  cycloheximide  and  CaCl2  to  a  CSF-arrested 
extract  results  not  only  in  the  escape  from  CSF  arrest 
but  also  in  a  subsequent  arrest  at  interphase,  owing  to 
the  inhibition  of  protein  synthesis,  specifically  the  syn¬ 
thesis  of  cyclins  (32).  We  examined  the  degradation  of 
Phe-DHFR,  bearing  a  type  2  primary  destabilizing  N- 
terminal  residue,  in  the  CSF-arrested,  interphase-ar¬ 
rested,  and  cycling  egg  extracts,  and  found  no  signifi¬ 
cant  differences  in  the  rate  of  degradation  of  this  N-end 
rule  substrate  in  different  extracts  (Fig.  3C). 

Dipeptides  Bearing  Destabilizing  N-Terminal 

Residues  Are  Efficacious  Inhibitors  of  the  N-End 

Rule  Pathway  in  Xenopus  Egg  Extracts 

Previous  work  has  shown  that  the  addition  of  amino 
acid  derivatives  such  as  dipeptides  bearing  destabiliz¬ 
ing  N-terminal  residues  to  reticulocyte  extracts  or  in¬ 
tact  yeast  cells  results  in  a  strong  and  selective  inhibi¬ 
tion  of  the  N-end  rule  pathway  in  these  settings  (19, 
27).  However,  experiments  with  intact  mammalian 
cells  in  culture  have  shown  dipeptides  to  be  at  most 
weak  inhibitors  of  the  N-end  rule  pathway  in  this 
setting^.  The  present  work  stemmed  in  part  from  the 
possibility  of  exploring  the  physiological  consequences 
of  perturbing  a  metazoan  N-end  rule  pathway  in  the 
inhibitor-accessible  setting  of  a  cell-free  system  such  as 
a  Xenopus  egg  extract. 
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FIG.  4.  Inhibition  of  the  N-end  rule  pathway  in  egg  extracts  by 
dipeptides  bearing  destabilizing  N-terminal  residues..  (A)  ^^S-la- 
beled,  purified  Ub-Arg-DHFR  was  added  to  CSF-arrested  Xenopus 
egg  extracts  in  the  absence  (□)  or  the  presence  of  the  following 
dipeptides  (10  mM),  together  with  bestatin  (50  /ig/ml):  Arg-jS-Ala  (O), 
Lys-Ala  (O),  Tyr-His  (A),  Trp-Ala  (ffi),  and  Ala-Lys  (X).  After  incu¬ 
bation  at  23®C  for  the  indicated  times,  the  amounts  of  soluble  in 
5%  TCA  were  determined  for  each  time  point.  (B)  Same  as  in  A  but 
with  Ub-Phe-DHFR. 


As  shown  in  Figs.  4  and  5,  Lys-Ala  and  especially 
Arg-jS-Ala,  both  of  which  bear  a  type  1  primary  desta¬ 
bilizing  N-terminal  residue,  strongly  inhibited  the  deg¬ 
radation  of  Arg-DHFR  in  the  egg  extract.  This  inhibi¬ 
tion  was  selective,  in  that  the  same  dipeptides  had  no 
significant  effect  on  the  degradation  of  Phe-DHFR, 
which  bore  a  type  2  primary  destabilizing  N-terminal 
residue.  Conversely,  Tyr-His  and  Trp-Ala,  which  bear 
type  2  destabilizing  N-terminal  residues,  strongly  in¬ 
hibited  the  degradation  of  Phe-DHFR,  but  had  at  most 
a  small  effect  on  the  degradation  of  Arg-DHFR  (Figs.  4 
and  5).  Ala-Lys,  which  bears  a  type  3  destabilizing 
N-terminal  residue  (19),  did  not  inhibit  the  degrada¬ 
tion  of  either  Arg-  or  Phe-DHFRs  (Fig.  4). 

The  dipeptides  had  little  effect  on  the  degradation  of 
their  cognate  N-end  rule  substrates  during  the  first  15 
min  of  the  incubation,  but  subsequent  proteolysis  was 
strongly  inhibited  (Fig.  4).  This  result  is  consistent 
with  the  finding  that  20-30%  of  Met-DHFR,  which 
bears  a  stabilizing  N-terminal  residue,  was  rapidly 
degraded,  whereas  the  rest  of  the  added  Met-DHFR 
was  stable  (Fig.  3A).  As  described  above,  the  latter 
finding  could  be  accounted  for  if  our  preparation  of 
Met-DHFR  (and,  by  inference,  of  other  X-DHFRs)  com¬ 
prised  two  distinct  populations  of  molecules:  undam¬ 
aged  ones  ('-^70%)  and  hence,  in  the  case  of  Met-DHFR, 
long-lived  in  the  extract,  and  misfolded  or  otherwise 
damaged  molecules  that  were  recognized  and  degraded 
by  a  proteol3d;ic  system  distinct  from  the  N-end  rule 
pathway.  The  failure  of  dipeptides  to  inhibit  the  initial 
burst  of  degradation  of  their  cognate  X-DHFRs  (Fig.  4) 
is  in  agreement  with  this  explanation. 


Apoptosis  and  the  N-End  Rule  Pathway 

Several  groups  have  described  the  use  of  Xenopus 
egg  extracts  to  analyze  the  process  of  apoptosis  (34,  36, 
41-43).  After  several  hours  of  incubation  of  the  inter- 
phase-arrested,  apoptotic  egg  extracts,  the  activity  of 
DEVD-specific  caspases  (a  subgroup  in  the  ICE/CED-3 
family  of  cysteine  proteases)  (43)  rises  sharply,  fol¬ 
lowed  by  fragmentation  of  the  added  sperm  nuclei  (34). 
Both  the  induction  of  caspases  and  the  fragmentation 
of  the  added  nuclei  in  the  egg  extract  can  be  suppressed 
by  the  addition  of  Bcl2,  a  protein  that  inhibits  apopto¬ 
sis  in  vivo  (34,  36).  Recent  work  has  shown  that  apo¬ 
ptotic  changes  in  egg  extract  are  triggered,  through 
unknown  cytosolic  factors,  by  the  release  of  C3rtochrome 
c  from  the  mitochondrial  fraction  of  the  extract  (43). 
Apparently  Bcl2  inhibits  apoptosis  by  blocking  the  re¬ 
lease  of  cytochrome  c  from  mitochondria  (43,  44). 

We  wished  to  determine  whether  the  apoptosis-like 
events  in  egg  extracts  would  be  perturbed  by  dipeptide 
inhibitors  of  the  N-end  rule  pathway.  In  preliminary 
experiments,  we  found  that  the  times  of  apoptotic 
changes  in  egg  extracts  varied  considerably  between 
independent  extract  preparations.  Specifically,  the 
sharp  rise  of  the  DEVD-specific  protease  activity  and 
the  disintegration  of  the  added  sperm  nuclei  coiild  oc¬ 
cur  as  early  as  2  h  after  the  start  of  incubation  or  as 


A  Minutes 

^  <b  ^ 


no  Inhibitors 


B  Minutes 
^  ^ 


no  inhibitors 


Arg-p-Ala 


Arg-B-Ala 


Trp-Ala  Trp-Ala 

FIG.  5.  Electrophoretic  analysis  of  N-end  rule  substrates  in  egg 
extracts  in  the  presence  of  dipeptides.  (A)  ^®S-labeled,  purified  Ub- 
Arg-DHFR  was  added  to  a  CSF-arrested  Xenopus  egg  extract  in  the 
absence  or  the  presence  of  dipeptides  Arg-j3-Ala  or  Trp-Ala  (at  10 
mM,  together  with  bestatin  at  50  pg/ml).  After  incubation  at  23®C  for 
the  indicated  times,  the  samples  were  analyzed  by  SDS~PAGE,  and 
the  labeled  proteins  were  detected  by  autoradiography.  Open  arrow¬ 
heads  indicate  the  bands  of  the  initial,  34-kDa  Ub-Arg-DHFR. 
Closed  arrowheads  indicate  the  bands  of  deubiquitylated,  26-kDa 
Arg-DHFR.  (B)  Same  as  in  A  but  with  Ub-Phe-DHFR. 
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FIG.  6.  Apoptotic  changes  in  Xenopus  egg  extracts,  assayed  by 
measuring  the  DEVD-specific  protease  activity.  (A)  A  preparation  of 
the  CSF-arrested  egg  extract  was  divided  into  three  samples.  The 
extracts  were  left  imtreated  (□)  or  were  activated  by  either  0.4  mM 
CaCl2  (O)  or  0.4  mM  CaCl2  in  the  presence  of  0.1  mg/ml  cyclohexi- 
mide  (O).  The  samples  were  incubated  at  room  temperature  for  up  to 
5  h.  The  DEVD-specific  protease  activity  in  the  extracts  was  deter¬ 
mined  using  the  fiuorogenic  substrate  Ac-DEVD-AMC.  (B)  Samples 
of  interphase-arrested  egg  extract  were  incubated  either  in  the  ab¬ 
sence  (O)  or  in  the  presence  of  the  following  reagents:  Lys-Ala  at  20 
mM  (B);  Ala-Lys  at  20  mM  «►);  Trp-Ala  at  20  mM  (X);  Trp-Ala  plus 
Lys-Ala,  each  at  10  mM  (O);  Ac-DEVD-CHO  (an  inhibitor  of  DEVD 
proteases,  at  10  mM)  (A).  The  dipeptides  were  added  together  with 
bestatin  (50  pg/ml).  The  DEVD-specific  protease  activity  was  mea¬ 
sured  as  in  A. 


late  as  5  h  (data  not  shown).  This  variability,  reported 
by  others  as  well  (43),  is  likely  to  result  in  part  from  the 
variabihty  of  the  timing  of  upstream  events  that  trig¬ 
ger  the  beginning  of  apoptosis  in  the  Xenopus  egg  ex¬ 
tract.  In  addition,  the  timing  of  apoptotic  changes  de¬ 
pended  on  the  state  of  the  extracts;  when  a  single 
preparation  was  split  into  three  samples,  and  was  used 
as  CSF-arrested,  cycling,  and  interphase-arrested  ex¬ 
tracts,  the  increase  of  the  DEVD-specific  protease  ac¬ 
tivity  was  observed  first  in  the  interphase-arrested 
extract,  then  in  the  CSF-arrested  one,  and  later  in  the 
cycling  extract  (Fig.  6A).  This  pattern  was  reproducible 
between  different  preparations  of  the  extract,  even 
though  the  absolute  timing  of  the  rise  of  the  DEVD- 
specific  protease  activity  varied  significantly  between 
preparations  (data  not  shown). 

We  asked  whether  selective  inhibition  of  the  N-end 
rule  pathway  would  affect  the  nature  or  kinetics  of  the 
apoptotic  changes.  Dipeptides  that  had  been  shown  to 
be  efficacious  inhibitors  of  the  N-end  rule  pathway 
(Figs.  2  and  3A)  were  added  to  the  interphase-arrested 
and  cycling  extracts  at  the  beginning  of  incubation.  We 
found  that  Lys-Ala  and  Arg-/3-Ala,  containing  type  1 
primary  destabilizing  N-terminal  residues,  and  Ala- 
Lys,  containing  a  t3T)e  3  destabilizing  residue,  did  not 
alter  the  timing  of  DEVD-specific  protease  induction 
(Fig.  6B).  In  contrast,  Trp-Ala,  which  bears  a  t3T)e  2 
destabilizing  N-terminal  residue,  delayed  both  the  rise 
of  the  DEVD-specific  protease  activity  and  the  disinte¬ 
gration  of  sperm  nuclei  by  ~1  h  in  8  out  of  12  indepen¬ 


dent  experiments  (Fig.  6B  and  data  not  shown).  The 
~1  h  delay  of  apoptotic  changes  by  Trp-Ala  was  signif¬ 
icant  but  weak  in  comparison  to  the  effect  of  Ac-DEVD- 
CHO,  an  inhibitor  of  DEVD-specific  proteases,  which 
delayed  apoptosis  by  at  least  2  h  (data  not  shown). 
Interestingly,  the  proteasome  inhibitor  5-iodo-4-hy- 
droxyl-3-nitrophenylacetyl-leucinyl-leucinyl-leucine- 
vinyl  sulfone  (NIP-L3VS)  (45),  which  inhibited  the  deg¬ 
radation  of  Phe-DHFR  in  egg  extracts,  also  delayed 
apoptosis  in  the  CSF-arrested  extract  by  ~1  h  (data 
not  shown),  similarly  to  the  effect  of  Trp-Ala  (Fig.  6B). 

DISCUSSION 

We  report  the  following  results: 

(1)  A  whole-cell  extract  ofX  laevis  eggs  contains  the 
N-end  rule  pathway,  as  determined  by  monitoring  the 
metabolic  fates  of  purified,  ^®S-labeled  Ub-X-DHFR 
test  proteins  added  to  the  extract.  Among  the  cells  of 
multicellular  organisms,  the  N-end  rule  pathway  has 
been  identified  and  characterized  in  rabbit  reticulo¬ 
cytes  (18)  and  L  cells,  a  mouse  cell  line  (19,  26),  but  not, 
until  now,  in  germ  line  cells. 

(2)  The  partial  N-end  rule  defined  in  egg  extracts 
was  found  to  be  identical  to  the  corresponding  subsets 
of  the  N-end  rules  in  rabbit  reticulocytes  and  mouse 
fibroblasts.  In  particular,  we  showed  that  in  egg  ex¬ 
tracts  the  destabilizing  activity  of  N-terminal  Asn, 
Asp,  and  Cys  requires  the  presence  of  RNA  (presum¬ 
ably  tRNA),  strongly  suggesting  that  these  are  second¬ 
ary  (Asp  and  Cys)  and  tertiary  (Asn)  residues  in  the 
egg’s  N-end  rule  pathway,  as  they  have  been  shown  to 
be  in  other  metazoan  cells  (see  introduction  for  the 
terminology)  (7). 

(3)  The  activity  of  the  N-end  rule  pathway  did  not 
change  significantly  upon  shifts  of  the  CSF-arrested  egg 
extract  to  the  cycling  or  the  interphase-arrested  state. 

(4)  Dipeptides  bearing  destabilizing  N-terminal  res¬ 
idues  selectively  and  efficiently  inhibited  the  degrada¬ 
tion  of  N-end  rule  substrates  in  egg  extract.  In  partic¬ 
ular,  dipeptides  bearing  the  basic  (type  1)  primary 
destabilizing  N-terminal  residues  inhibited  the  degra¬ 
dation  of  N-end  rule  substrates  bearing  type  1  but  not 
type  2  N-terminal  residues.  A  converse  inhibition  pat¬ 
tern  was  observed  with  dipeptides  bearing  bulky  hy¬ 
drophobic  (type  2)  N-terminal  residues. 

(5)  The  onset  of  apoptosis-like  changes  in  egg  extract 
(fragmentation  of  the  added  sperm  nuclei  and  abrupt 
increase  of  the  DEVD-specific  protease  activity  (34, 
43))  was  found  to  be  delayed  by  ~1  h  in  the  presence  of 
dipeptide  Trp-Ala,  which  bears  a  t5q)e  2  destabilizing 
N-terminal  residue,  but  not  in  the  presence  of  dipep¬ 
tides  bearing  type  1  destabilizing  N-terminal  residues. 

The  N-end  rule  pathway  has  been  identified  in  all 
organisms  examined,  from  mammals  to  fungi  and  bac- 
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teria,  but  the  understanding  of  its  functions  remains 
incomplete  (6,  7),  The  first  clear  example  of  a  physio¬ 
logical  function  of  the  N-end  rule  pathway  is  its  re¬ 
cently  identified  role  in  controlling  the  import  of  pep¬ 
tides  in  S.  cereuisiae  through  the  degradation  of  the 
transcriptional  repressor  Cup9p  that  down-regulates  a 
peptide  transporter  (25,  46).  It  has  also  been  reported 
that  dipeptides  bearing  destabilizing  N-terminal  resi¬ 
dues  specifically  inhibit  differentiation  of  rat  pheochro- 
mocytoma  PC  12  cells  (47),  the  neurite  outgrowth  in 
amphibian  neuroepithelial  cells  (48),  and  limb  regen¬ 
eration  in  the  newt  (49),  suggesting  a  role  for  the  N-end 
rule  pathway  in  these  biological  processes.  Unfortu¬ 
nately,  none  of  these  studies  (47-49)  included  control 
experiments  to  verify  that  the  added  dipeptides  actu¬ 
ally  inhibited  degradation  of  a  reporter  N-end  rule 
substrate  in  the  target  cells.  Thus,  it  remains  to  be 
determined  whether  the  observed  biological  effects  of 
dipeptides  (47-49)  were  caused  by  specific  inhibition  of 
the  N-end  rule  pathway  in  the  target  cells  or  resulted 
from  other  effects  of  the  added  dipeptides. 

Apart  from  the  interest  in  determining  whether  the 
N-end  rule  pathway  is  present  in  germ  line  cells  such 
as  Xenopus  eggs,  one  reason  for  initiating  this  work 
was  the  opportxmity  to  test,  through  selective  inhibi¬ 
tion  of  the  N-end  rule  pathway  in  egg  extracts,  whether 
this  pathway  plays  a  role  in  apoptosis.  Several  lines  of 
evidence  suggest  a  role  for  the  Ub/proteasome  system, 
of  which  the  N-end  rule  pathway  is  a  part,  in  regulat¬ 
ing  apoptosis  (50-55).  In  particular,  cell-penetrating 
proteasome  inhibitors  were  reported  to  partially  pro¬ 
tect  nonproliferating  cells  such  as  thymocytes  and 
growth  factor-deprived  sympathetic  neurons  from  ap¬ 
optosis  (52,  53).  In  contrast,  the  same  proteasome  in¬ 
hibitors  were  reported  to  induce  apoptosis  in  mitoti- 
cally  active  cells  (51,  54).  An  apoptosis-like  process, 
induced  by  the  stress  of  starvation  and  mediated  by 
proteolysis,  was  also  identified  in  E.  coli  (56).  A  key 
regulator  of  E.  coli  apoptosis  is  a  short-lived  protein 
MazE,  which  is  degraded  by  ClpAP,  a  proteasome-like 
ATP-dependent  protease  that  targets  N-end  rule  sub¬ 
strates  in  E.  coli  (38).  These  and  other  considerations, 
including  the  possibility  that  cleavages  by  caspases  can 
produce  N-end  rule  substrates,  led  to  the  suggestion 
that  the  N-end  rule  pathway  may  function  in  the  con¬ 
trol  of  apoptosis  (7). 

Specific  dipeptides,  which  have  been  shown  to  act  as 
selective  inhibitors  of  the  N-end  rule  pathway  in  re¬ 
ticulocyte  extracts  (19,  27)  and  intact  S.  cerevisiae  cells 
(28),  were  foimd  to  be  largely  ineffective  with  several 
lines  of  mammalian  cells^.  Xenopus  egg  extracts,  in 
contrast  to  reticulocyte  extracts,  have  been  shown  to 
undergo  apoptosis-like  changes  (34,  36,  43),  hence  the 
choice  of  these  extracts  for  the  present  study.  We  found 
that  dipeptides  bearing  either  type  1  (Arg,  Lys)  or  t3^e 
3  (Ala)  destabilizing  N-terminal  residues  had  no  effect 


on  the  apoptotic  changes  in  the  extracts.  In  contrast, 
Trp-Ala,  which  bears  a  type  2  destabilizing  N-terminal 
residue,  inhibited  apoptosis  in  a  number  of  indepen¬ 
dent  experiments  with  different  preparations  of  egg 
extracts;  in  a  minority  of  tests  the  effect  was  not  ob¬ 
served.  Tyr-His,  a  dipeptide  bearing  another  type  2 
destabilizing  N-terminal  residue,  strongly  inhibited 
the  degradation  of  an  N-end  rule  substrate  such  as 
Phe-DHFR  in  the  extract,  but  was  significantly  less 
efficacious  than  Trp-Ala  in  its  effect  on  the  timing  of 
apoptotic  changes  in  the  same  extract.  Taken  together, 
the  results  of  experiments  with  dipeptides  as  inhibitors 
of  apoptosis  indicate  that  some  inhibitors  of  the  N-end 
rule  pathway,  notably  Trp-Ala,  can  cause  a  significant 
delay  of  apoptosis  in  the  extract.  A  definitive  test  and 
further  analysis  of  the  thus  suggested  function  of  the 
N-end  rule  pathway  in  apoptosis  will  require  mutants 
that  eliminate  this  pathway  in  a  multicellular  organ¬ 
ism  without  perturbing  the  rest  of  the  Ub  system. 
Construction  of  a  mouse  mutant  that  lacks  the  entire 
N-end  rule  pathway  (through  a  deletion  of  the  Ubrl 
gene  that  encodes  N-recognin,  the  main  recognition 
component  of  this  proteolytic  system)  is  imder  way^. 

In  summary,  we  established  the  existence  of  the 
N-end  rule  pathway  in  germ  line  cells  such  as  Xenopus 
eggs,  showed  that  dipeptides  are  efficacious  inhibitors 
of  the  N-end  rule  pathway  in  egg  extracts,  and  exam¬ 
ined  the  effects  of  dipeptides  on  apoptotic  changes  in 
these  extracts. 
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ABSTRACT  The  insufficient  selectivity  of  drugs  is  a  bane 
of  present-day  therapies.  This  problem  is  significant  for 
antibacterial  drugs,  difficult  for  antivirals,  and  utterly  un¬ 
solved  for  anticancer  drugs,  which  remain  ineffective  against 
major  cancers,  and  in  addition  cause  severe  side  effects.  The 
problem  may  be  solved  if  a  therapeutic  agent  could  have  a 
multitarget,  combinatorial  selectivity,  killing,  or  otherwise 
modifying,  a  cell  if  and  only  if  it  contains  a  predetermined  set 
of  molecular  targets  and  lacks  another  predetermined  set  of 
targets.  An  earlier  design  of  multitarget  drugs  [Varshavsky, 
A.  (1995)  Proc.  Natl.  Acad.  Sci.  USA  92,  3663-3667]  was 
confined  to  macromolecular  reagents  such  as  proteins,  with 
the  attendant  difficulties  of  intracellular  delivery  and  immu- 
nogenicity.  I  now  propose  a  solution  to  the  problem  of  drug 
selectivity  that  is  applicable  to  small  (^1  kDa)  drugs.  Two 
ideas,  codominant  interference  and  antieffectors,  should  allow 
a  therapeutic  regimen  to  possess  combinatorial  selectivity,  in 
which  the  number  of  positively  and  negatively  sensed  macro- 
molecular  targets  can  be  two,  three,  or  more.  The  nature  of  the 
effector  and  interference  moieties  in  a  multitarget  drug 
determines  its  use:  selective  killing  of  cancer  cells  or,  for 
example,  the  inhibition  of  a  neurotransmitter-inactivating 
enzyme  in  a  specific  subset  of  the  enzyme-containing  cells.  The 
in  vivo  effects  of  such  drugs  would  be  analogous  to  the 
outcomes  of  the  Boolean  operations  “and,”  “or,”  and  combi¬ 
nations  thereof.  1  discuss  the  logic  and  applications  of  the 
antieffector  and  interference/codominance  concepts,  and  the 
attendant  problem  of  pharmacokinetics. 


The  many  successes  of  pharmacology  (1,  2)  do  not  include  the 
problem  of  cancer.  Major  human  cancers  are  incurable  once 
they  have  metastasized.  A  few  relatively  rare  cancers,  such  as 
testicular  carcinoma  in  men,  Wilms'  kidney  tumor,  and  some 
leukemias  in  children,  can  often  be  cured  through  chemother- 
apvbut  require  cytotoxic  treatments  of  a  kind  that  cause  severe 
side  effects  and  are  themselves  carcinogenic  (2). 

The  main  reason  for  the  failure  of  cytotoxic  therapies  is  their 
insufficient  selectivity  for  tumors.  For  example,  treatments 
with  radiation  or  alkylating  agents  perturb  many  functions  that 
are  common  to  all  cells.  The  more  selective  cytotoxic  drugs,  for 
instance,  methotrexate,  taxol.  and  etoposide,  perturb  the 
functions  of  specific  macromolecular  targets  (dihydrofolate 
reductase,  microtubules,  and  topoisomerase  II),  but  these 
targets  are  present  in  both  normal  and  malignant  cells  (1,2). 
Hence  the  low  therapeutic  index  of  anticancer  drugs  and  their 
systemic  toxicity  at  clinically  relevant  doses.  Because  the 
mitotic  activity  of  cells  in  a  tumor  is  often  lower  than  the 
mitotic  activity  of  normal  cells  in  self-renewing  tissues  such  as 
the  bone  marrow  (3),  one  might  not  have  expected  these  drugs 
to  work  at  all— to  have  any  preference  for  the  killing  of  cancer 
cells.  That  such  preference  actually  exists  stems  in  part  from 
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the  fact  that  tumor  cells  are  often  perturbed  by  their  mutations 
into  stress-hypersensitive  states.  Consequently,  these  tumor 
cells  die  an  apoptotic  death  at  the  level  of  a  drug-imposed 
metabolic  stress  that  induces  apoptosis  in  some  but  not  in  most 
of  the  organism's  normal  cells  (3). 

With  some  cancers,  cytotoxic  therapies  are  ineffective  from  the 
beginning.  In  other  cases,  these  therapies  yield  a  partial,  some¬ 
times  clinically  complete,  but  almost  invariably  transient  remis¬ 
sion  of  a  cancer,  in  part  because  these  treatments  select  for  tumor 
cell  variants  that  retain  tumorigenicity  but  are  more  resistant  to 
either  apoptosis  per  se  or  a  drug  that  induces  apoptosis.  Because 
a  significant  increase  in  the  drug  or  radiation  dosage  is  precluded 
by  their  low  therapeutic  index,  these  therapies  become  ineffective 
when  resistant  clones  of  malignant  cells,  selected  by  a  drug 
treatment,  present  themselves  as  a  cancer  recurrence. 

The  failure  of  small  cytotoxic  drugs  to  produce  a  cure  for 
cancer  has  given  rise  to  other  strategies,  in  panicular  the  insightful 
suggestion  that  solid  tumors  can  be  targeted  by  selectively  inhib¬ 
iting  neovascularization,  a  process  that  these  tumors  depend  on 
for  growing  to  a  clinically  significant  size  (4).  Another  approach, 
immunotoxins.  involves  the  linking  of  a  toxin  to  a  ligand  such  as 
an  antibody  or  a  growth  factor  that  binds  to  a  target  on  the  surface 
of  tumor  cells  (5).  Among  the  limitations  of  present-day  immu¬ 
notoxins  is  their  incapacity,  on  entering  a  cell,  to  adjust  their 
toxicitv  in  response  to  the  intracellular  protein  composition.  Yet 
another  approach  is  to  enhance  the  ability  of  the  immune  system 
to  identify  and  selectively  destroy  tumor  cells.  The  current  revival 
of  this  strategy  holds  the  promise  of  a  rational  and  curative 
treatment  (6).  Given  the  complicated  regimens  and  unsolved 
problems  of  immunotherapies,  it  is  clear  that  this  and  other  recent 
approaches  (7,  8)  are  motivated  in  part  by  the  perception  that 
small-drug  pharmacology,  so  successful  against  bacterial  infec¬ 
tions,  is  unlikely  to  prove  effective  against  cancer.  In  contrast  to 
this  view,  the  premise  of  the  strategy  described  in  the  present 
work  is  that  small  anticancer  drugs  may  become  curative  and  free 
of  severe  side  effects  if  a  way  is  found  to  confer  on  these 
compounds  a  multitarget,  combinatorial  selectivity. 

Most  cancers  are  monoclonal:  cell  lineages  of  both  the  primary 
tumor  and  the  metastases  originate  from  a  single  founder  cell. 
This  cell  is  a  breakthrough  descendant  of  a  cell  lineage  that  has 
been  accumulating  mutations  for  some  time,  often  in  proximity 
to  other  neoplastic  but  still  nonmetastatic  cell  lineages  within  an 
indolent  proliferative  lesion  such  as  a  benign  tumor  (9, 10).  Given 
the  monoclonality  of  a  cancer,  cells  of  both  the  primary  tumor  and 
the  metastases  share  the  initial  mutations  that  yielded  the 
founder  cell,  even  if  these  cells  differ  at  other  loci  that  accumu¬ 
lated  mutations  in  the  course  of  the  later  tumor  progression. 
Some  of  the  early  mutations  are  in  genes  that  encode  tumor 
suppressors  (9,  11,  12).  In  most  cancers,  both  alleles  of  a  tumor 
suppressor  gene  are  inactivated,  sometimes  through  deletions 
that  encompass  the  gene  on  the  two  homologous  chromosomes. 
Thus,  a  monoclonal  cancer,  although  heterogeneous  genetically. 
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always  contains  a  set  of  founder  mutations  that  is  shared  by  all  of 
its  cells. 

A  drug  that  kills  a  cell  only  if  it  lacks  a  specific  macromo- 
lecular  target  would  distinguish  tumor  cells  from  many  other 
cells  of  an  organism,  provided  that  the  target  is  a  product  of 
a  gene  that  had  been  deleted  or  inactivated  in  this  cancer  at  the 
stage  of  its  founder  cell.  Such  a  drug  maybe  especially  selective 
against  cancers  that  lack  a  gene  for  a  ubiquitously  expressed 
tumor  suppressor,  for  example,  the  retinoblastoma  (Rb)  pro¬ 
tein  (11, 12).  An  example  of  the  negative-target  approach  is  the 
use  of  a  mutant  adenovirus  that  replicates  selectively  in  human 
cancer  cells  lacking  the  tumor  suppressor  p53  and  has  been 
shown  to  kill  these  cells  in  a  model  setting  (8). 

However,  other  tumor  suppressors  may  not  be  expressed  at 
comparable  levels  in  most  cells.  A  drug  that  kills  a  cell  if  it  lacks 
a  nonubiquitous  tumor  suppressor  would  be  toxic  to  a  subset 
of  normal  cells  as  well.  This  problem  could  be  reduced  through 
the  use  of  a  drug  that  is  toxic  only  if  a  cell  lacks  two  specific 
macromolecules,  termed  negative  targets.  Two  judiciously 
chosen  negative  targets  may,  together,  suffice  to  distinguish  all 
of  the  cancer  cells  from  all  of  the  organism’s  normal  cells.  If 
they  do  not,  a  third  negative  target  that  had  been  deleted  or 
rendered  defective  in  a  given  cancer  can  be  employed  as  well. 
This  strategy  requires  a  drug  that  possesses  the  ability  to  kill 
a  cell  if  it  lacks  two  or  more  of  the  predetermined  targets,  but 
would  spare  a  cell  containing  either  one  of  these  targets. 

Other  changes  in  a  founder  cell  may  involve  a  missense 
mutation,  an  amplification  and  overexpression,  an  ectopic 
expression,  or  a  translocation/fusion  of  a  specific  protoonco¬ 
gene  such  as,  for  example,  Ras  or  Myc  (9,  10,  13).  A  single 
oncoprotein  may  not  be  a  unique  enough  target  by  itself,  for 
reasons  similar  to  those  described  above  in  the  context  of 
negative  targets.  However,  a  combination  of  two  or  more 
distinct  oncoproteins  that  were  either  mutated  or  inappropri¬ 
ately  expressed  in  the  founder  cell  can  be  employed  to 
formulate  the  unique  multiprotein  signature  of  a  specific 
cancer  that  comprises  both  positive  and  negative  targets. 

These  considerations  suggest  that  a  conditionally  cytotoxic 
therapeutic  regimen  that  is  exquisitely  specific  for  a  given 
cancer,  and  therefore  would  eliminate  it  without  significant 
side  effects,  must  possess,  in  most  cases,  a  multitarget,  com¬ 
binatorial  (positive/negative)  selectivity  of  the  kind  defined 
above.  Conversely,  even  an  informed  choice  of  the  molecular 
target  for  a  single-target  drug  may  not  suffice  to  define 
unambiguously  the  cell  type  to  be  eliminated.  Note  that  simply 
combining  two  single-target  drugs  against  two  different  targets 
in  a  multidrug  regimen  would  not  yield  a  multitarget  selectiv¬ 
ity,  because  the  two  drugs  together  would  perturb  not  only  cells 
containing  both  targets  but  also  cells  containing  either  one  of 
the  targets. 

Although  the  problem  of  insufficient  selectivity  is  not  as 
acute  with  noncytotoxic  drugs,  it  is  relevant  to  them  as  well. 
Among  the  multitude  of  examples  are  side  effects  of  therapies 
with  antipsychotic  agents.  The  side  effects  are  caused  in  part 
by  the  insufficient  molecular  specificity  of  drugs,  which  is 
exemplified  by  the  ability  of  antidepressants  that  inhibit  mono¬ 
amine  oxidase  to  perturb  other  proteins  as  well  (2),  This 
difficulty  will  continue  to  abate  with  the  development  of  more 
specific  single-target  inhibitors.  But  an  entirely  distinct,  major, 
and  unsolved  problem  with  inhibitors  as  drugs  is  the  current 
impossibility  of  restricting  their  action  to  a  specific  subset  of 
cells  among  those  that  contain  the  inhibitor’s  target.  For 
example,  even  an  exquisitely  specific  inhibitor  of  a  clinically 
relevant  enzyme  is  likely  to  have  side  effects,  because  the 
target  enzyme  is  present,  in  most  cases,  not  only  in  the  cells 
where  its  inhibition  is  clinically  beneficial  but  also  in  the  cells 
where  its  inhibition  is  physiologically  inappropriate.  The 
present  work  describes  a  possible  solution  of  this  problem. 

A  previously  proposed  approach  to  designing  multitarget  drugs 
utilized  degradation  signals  (degrons)  and  analogous  signals  that 


exhibit  the  property  of  codominance  (14,  15).  As  a  result,  this 
strategy  was  confined  to  macromolecular  reagents  such  as  pro¬ 
teins,  with  the  attendant  problems  of  immunogenicity,  extrava¬ 
sation,  and  intracellular  delivery.  The  latter  difficulty  is  especially 
significant,  because  either  gene-therapy  or  direct-delivery  meth¬ 
ods  for  introducing  large  molecules  into  cells  work  reasonably 
well  with  cells  in  culture  but  are  still  inefficient  with  cells  in  an 
intact  organism.  The  challenge,  then,  is  to  attain  a  multitarget, 
combinatorial  selectivity  in  the  setting  of  small  (<1  kDa)  drugs, 
where  the  immunogenicity  and  delivery  problems  are  less  severe. 
A  solution,  described  below,  invokes  a  modification  of  the  earlier 
idea  of  codominant  interference  (14)  in  conjunction  with  the  new 
concept  of  antieffectors.  This  solution  is  applicable  to  either 
cytotoxic  or  noncytotoxic  therapies. 

RESULTS  AND  DISCUSSION 

Multitarget  Compounds  Specific  for  Negative  Targets:  The 
Concept  of  Codominant  Interference.  Previous  work  (14) 
suggested  that  the  property  of  codominance,  characteristic  of 
degradation  signals  (degrons)  and  many  other  signals  in 
biopolymers,  can  be  employed  to  design  protein-based  re¬ 
agents  that  possess  multitarget,  combinatorial  selectivity  of  the 
kind  defined  above.  Codominance  refers  to  the  ability  of  two 
or  more  signals  in  the  same  molecule  to  function  indepen¬ 
dently  and  not  to  interfere  with  each  other.  It  is  shown  below 
that  a  distinct  version  of  the  interference/codominance  (IC) 
concept  (14)  is  applicable  to  small  (<1  kDa)  compounds. 
Consider  a  reagent  containing  three  small  moieties  a,  b,  and 
i,  which  can  bind,  respectively,  to  three  macromolecular  targets 
A,  B,  and  I.  Because  the  moieties  a,  b,  and  i  are  much  smaller 
than  the  macromolecules  A,  B,  and  I,  it  should  be  possible  to 
arrange  these  moieties  in  the  compound  abi  in  such  a  way  that 
the  binding  of  A  or  B  to  a  or  b  would  preclude,  through  steric 
hindrance,  the  binding  of  moiety  i  to  I  (Fig.  1). 

That  the  interactions  of  a  small  bipartite  compound  with  its 
two  macromolecular  ligands  can  be  made  mutually  exclusive  is 
expected  from  basic  physicochemical  considerations.  This  has 
also  been  demonstrated  directly,  in  a  context  unrelated  to  the 
present  discussion.  When  lisinopril,  an  inhibitor  of  the  angio¬ 
tensin-converting  enzyme  (ACE),  was  connected,  via  an  11- 
atom  linker,  to  the  biotin  moiety,  the  resulting  bivalent  com¬ 
pound  could  bind  to  and  inhibit  ACE  in  the  absence  but  not 
in  the  presence  of  the  biotin-binding  protein  streptavidin  (16). 
Small  compounds  comprising  two  linker-connected  moieties 
such  as  cyclosporin  and  FK506,  which  are  specific  for  two 
macromolecular  targets,  have  previously  been  employed  as  in 
vivo  dimerization  devices,  making  it  possible  to  bring  together 
two  otherwise  noninteracting  proteins  (17).  However,  the 
linker  moiety  of  these  bipartite  compounds  was  chosen  to  allow 
simultaneous  interactions  with  the  targets,  in  contrast  to  the 
mutual  exclusivity  of  interactions  in  the  IC  approach  (Fig.  1). 

If  the  moiety  i  is  an  inhibitor  of  an  essential  cellular  enzyme  I, 
the  presence  of  the  macromolecular  targets  A  or  B  in  a  cell  would 
reduce  the  inhibition  of  enzyme  I  by  abi,  because  the  complexes 
abi-A  and  abi-B  would  be  mutually  exclusive  with  the  complex 
abi-I  (Fig.  1).  Note  that  A  and  B  are  codominant  in  their  ability 
to  reduce  the  inhibition  of  I  by  abi.  Therefore,  there  is,  formally, 
no  limit  on  the  number  of  a,  b-like  competition  modules  that  can 
be  used  to  construct  an  abi-like  compound  whose  activity  is 
sensitive  to  the  presence  of  several  distinct  macromolecules, 
called  negative  targets.  The  fractional  occupancy  of  the  macro¬ 
molecular  targets  A,  B,  and  I  by  the  a,  b,  and  i  moieties  of  abi 
would  be  determined  in  part  by  the  targets’  intracellular  concen¬ 
trations.  There  are  also  specific  pharmacokinetic  constraints  on 
the  selectivity  of  abi,  an  issue  discussed  below. 

A  tabulation  of  the  relative  toxicities  of  abi  for  cells  that  either 
lack  or  contain  the  negative  targets  A  and  B  is  shown  in  Fig.  2.  It 
can  be  seen  that  abi  would  be  relatively  nontoxic  to  three  of  the 
four  ceU  types  and  toxic  exclusively  to  the  cells  that  lack  both  A 
and  B  (Fig.  2).  Thus,  the  IC  concept  allows  the  construction  of 
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Fig.  1.  The  interference/codomi- 
nance  concept.  (1^)  A  small  (<1 
kDa)  moiety  i  is  linked  to  two  other 
small  moieties,  a  and  b.  The  moieties 
a,  b,  and  i  are  ligands  of  the  macro¬ 
molecules  A,  B,  and  I,  respectively. 
The  distances  between  a,  b,  and  i,  and 
their  mutual  arrangement  in  the  tri¬ 
partite  compound  abi  are  such  that 
the  interaction  i-I  is  mutually  exclu- 

...  ■  «  o  -r-  11  either  the  interaction  a-A  or 

the  interaction  b-B.  Specifically,  the  macromolecule  I  in  its  complex  with  the  small  moiety  i  would  sterically  clash  with  the  macromolecules  A  or 
B  if  either  A  or  B  is  positioned  to  bind  a  or  b  of  abi  (5  and  6),  In  the  diagram,  the  interactions  a-A  and  b-B  are  also  mutually  exclusive,  but  this 
constraint  is  not  essential.  (Note  that  if  the  interactions  a-A  and  b-B  were  mutually  nonexclusive,  the  compound  abi  would  promote  the  binding 
of  A  to  B.)  The  codominance  aspect  of  the  IC  concept  allows  this  design  to  accommodate  more  than  two  of  the  a,  b-like  competition  modules  (not 
shown).  In  Figs.  2-4,  the  i  moiety  is  an  inhibitor  of  an  essential  enzyme  I.  In  fact,  the  only  constraint  on  the  identities  of  i  and  I  is  the  requirement 
for  an  i-I  interaction  to  alter  the  functional  activity  of  a  macromolecule  I.  In  other  words,  the  choice  of  I  is  determined  by  the  intended  effect  of 
the  (unsequestered)  compound  abi  (see  the  main  text). 


small  compounds  that  exhibit  multitarget  selectivity  for  negative 
targets.  One  more  idea  is  required  to  accomplish  the  same  for 
positive  targets  and  to  link  the  two  strategies. 

Multitarget  Compounds  Specific  for  Positive  Targets:  The 
Concept  of  Antieffectors.  Consider  a  small  compound  i*  that 
binds  to  enzyme  I  in  the  vicinity  of  its  active  site,  but  does  not 
perturb  the  catalytic  activity  of  I  toward  its  physiological 
substrates  (Fig.  3B).  Suppose  further  that  the  compound  i*, 
termed  an  antiinhibitor,  was  designed  to  interfere,  sterically, 
with  the  binding  of  an  inhibitor  i  to  the  enzyme’s  active  site 
while  at  the  same  time  allowing  the  binding  of  physiological 
substrates.  One  way  to  achieve  this  would  be  to  endow  either 
i,  or  i*,  or  both  of  them  with  a  set  of  chemical  groups,  termed 
a  “bump,”  whose  function  is  to  produce  steric  hindrance  that 
makes  the  interactions  i-I  and  i*-I  mutually  exclusive  (Fig.  3  A 
and  B).  A  moiety  that  functions  as  a  bump  may  also  be 
designed  to  enhance  specific  binding  of  either  the  inhibitor  i  or 
the  antiinhibitor  i*  to  enzyme  I,  but  this  consideration  is 
secondary  to  the  bump’s  essential  purpose. 

In  one  application  of  the  antiinhibitor  i*,  it  is  linked  to  c,  a 
small  moiety  that  can  bind  to  a  macromolecular  target  C.  The 
mutual  arrangement  of  i*  and  c  in  ci*  is  such  that  the 
interactions  of  ci*  with  I  and  C  are  mutually  exclusive.  In  the 
absence  of  C,  ci*  would  compete  with  abi  for  the  binding  to 


enzyme  I,  thereby  partially  protecting  I  from  inhibition  by  abi 
(Fig.  3C).  This  protective  effect  of  ci*  would  be  suppressed  in 
the  presence  of  its  macromolecular  target  C  (Fig.  3D).  In  the 
logic  of  codominance,  discussed  above  in  the  context  of 
negative  targets,  a  compound  bearing  an  antiinhibitor  moiety 
i*  could  contain  more  than  one  c-like  moiety.  For  example,  a 
compound  edi*,  whose  moieties  c  and  d  can  bind,  respectively, 
to  the  macromolecules  C  and  D,  would  reduce  the  inhibition 
of  enzyme  I  by  abi  only  in  the  absence  of  both  C  and  D.  Yet 
another  pattern  of  multitarget  selectivity  can  be  produced,  in 
this  context,  by  separating  the  competition  moieties  c  and  d. 
The  resulting  ci*  and  di*,  if  administered  together  with  abi, 
would  reduce  the  inhibition  of  enzyme  I  by  abi  if  just  one  of 
the  targets,  C  or  D,  is  absent.  As  shown  below,  the  key  merit 
of  the  antiinhibitor  idea  is  that  it  allows  the  effect  of  a  single 
inhibitor  i  to  be  modulated  by  both  negative  and  positive 
macromolecular  targets. 

On  the  Difference  Between  Antieffectors  and  Antagonists. 

The  distinctions  between  substrates,  inhibitors,  and  antiinhibi¬ 
tors  were  described  above.  The  concept  of  antieffectors  is  also 
relevant  to  ligand-binding  biopolymers  other  than  enzymes. 
For  example,  an  agonist  binds  to  its  receptor  and  evokes  a 
physiological  response.  An  antagonist  binds  to  a  site  of  the 
receptor  that  overlaps  with  the  agonist-binding  site,  does  not 

FIg.  2.  Multitarget  selectivity  of  a 
compound  that  utilizes  interference/ 
codominance.  This  diagram  tabulates 
the  relative  toxicities  of  the  compound 
abi  for  cells  that  either  lack  or  contain 
macromolecular  targets  A  and  B.  The  i 
moiety  of  the  compound  abi  (see  the 
legend  to  Fig.  1)  inhibits  an  essential 
enzyme  I.  The  interaction  i-I  is  mutu¬ 
ally  exclusive  with  the  interaction  a-A 
and  the  interaction  b-B,  the  macromol¬ 
ecules  A  and  B  being  negative  targets  of 
abi.  It  is  assumed  that  concentrations 
of  the  targets  A  and  B  in  cells  that 
contain  at  least  one  of  them  signifi¬ 
cantly  exceed  the  concentration  of  I 
(see  the  main  text).  In  A"^  B+,  A"^  B“, 
and  A“  B"*"  cells,  the  enzyme  I  would 
be  at  most  partially  inhibited  by  the  i 
moiety  of  abi,  because  of  the  compet¬ 
ing  interactions  of  abi  with  A  and/or  B. 
By  contrast,  in  A“  B“  cells,  the  bulk  of 
abi  molecules  would  be  available  for 
interaction  with  I,  resulting  in  the  se¬ 
lective  toxicity  of  abi  to  these  cells.  The 
selectivity  pattern  of  abi  requires  that 
certain  pharmacokinetic  conditions 
are  met  as  well  (see  the  main  text). 
Note  that  the  physiological  effects  and 
the  uses  of  abi-type  compounds  are  not 
confined  to  cytotoxic  regimens. 


Biochemistry:  Varshavsky 


Proc,  Natl  Acad,  ScL  USA  95  (1998)  2097 


Fig.  3.  The  antieffector  concept.  The  particular  case  illustrated 
here  is  that  of  an  antiinhibitor  i*,  defined  as  a  compound  whose 
binding  to  an  enzyme  I  does  not  inhibit  the  activity  of  I  but  does 
preclude  the  inhibition  of  I  by  an  inhibitor  i.  In  this  example,  the 
antiinhibitor  i*  has  the  following  properties.  First,  it  binds  to  I  in  the 
vicinity  of  the  Fs  active  site,  but  does  not  perturb  the  catalytic  activity 
of  I  toward  its  physiological  substrates.  Second,  !*,  in  its  bound  state, 
sterically  interferes  with  the  interaction  between  I  and  its  inhibitor  i. 
To  implement  the  second  condition,  either  i,  or  !♦,  or  both  of  them 
bear  additional  moiety,  a  '‘bump,”  denoted  by  the  rectangular  pro¬ 
trusions  in  i  and  i*.  The  function  of  the  bump  is  to  produce  steric 
hindrance  that  makes  the  interactions  i-I  and  i*-I  mutually  exclusive. 
The  inhibitor  i  described  here  and  in  the  main  text  is  a  competitive 
inhibitor,  but  i  could  be  a  noncompetitive  inhibitor  as  well.  An 
allosteric  antiinhibitor,  which  functions  through  binding  to  a  remote 
site  of  enzyme  I,  is  yet  another  possibility.  (A)  A  complex  of  the 
enzyme  I  with  its  inhibitor  i.  (B)  A  complex  of  I  with  its  antiinhibitor 
i*.  Note  that  the  bumps  of  the  bound  i  and  i*  spatially  overlap.  (C)  The 
antiinhibitor  i*  is  linked  to  c,  a  small  moiety  that  can  bind  to  a 
macromolecular  target  C.  The  design  of  ci*  is  analogous  to  abi  (Figs. 
1  and  2),  in  that  the  interactions  of  ci*  with  I  and  C  are  mutually 
exclusive.  When  ci*  is  bound  to  I,  the  inhibitor  i,  shown  here  as  a  part 
of  the  compound  abi  (Figs.  1  and  2),  is  unable  to  bind  to  and  inhibit 
the  enzyme  I.  (D)  A  complex  between  ci*  and  its  macromolecular 
target  C.  This  complex,  being  mutually  exclusive  with  the  ci*-I 
complex,  reduces  the  ability  of  ci*  to  protect  the  enzyme  I  from 
inhibition  by  abi. 

activate  the  receptor,  and  in  addition  precludes  the  binding  of 
agonist  (1,  2).  By  contrast,  an  antieffector,  which  would  be 
called,  in  this  setting,  an  antiantagonist,  binds  to  the  receptor 
in  such  a  way  that  the  receptor  can  still  bind,  and  respond  to, 
the  agonist,  but  cannot  bind  the  antagonist.  To  this  end,  either 
an  antagonist,  or  an  antiantagonist,  or  both  must  possess  a 
bump,  an  additional  moiety  described  above  in  the  context  of 


enzymes  and  antiinhibitors  (Fig.  3).  The  idea  of  antieffectors 
is  thus  distinct  from  that  of  antagonists  or  inhibitors  and  is  new, 
to  the  best  of  my  knowledge.  I  am  also  not  aware  of  a  naturally 
occurring  pair  of  compounds  that  satisfy  the  definition  of 
effectors/antieffectors  in  a  physiologically  relevant  setting. 

Interference/Codominance  and  Antieffector  in  a  Regimen 
That  Possesses  Combinatorial  Selectivity.  Applying  the  IC  and 
antieffector  concepts  together  yields  regimens  that  posses  true 
combinatorial  selectivity,  i.e.,  sensitivity  to  both  negative  and 
positive  targets.  Consider  a  population  of  cells  that  either  contain 
or  lack  the  macromolecular  targets  A,  B,  and  C.  Our  aim  is  to 
devise  a  treatment  that  would  be  toxic  to  cells  that  lack  A  and  B 
but  contain  C  (A"  B"  cells)  and  relatively  nontoxic  to  the 
other  cell  types  (Fig.  4).  A  regimen  of  two  compounds,  abi  (Fig. 
3C)  and  ci*  (Fig.  3Z)),  has  the  requisite  selectivity,  as  shown  in  Fig. 

4,  which  tabulates  the  outcomes  of  this  treatment  for  different  cell 
types.  Specifically,  in  the  absence  of  C  (four  cell  types  out  of 
eight),  the  antiinhibitor-containing  ci*  would  compete  with  the 
inhibitor-containing  abi  for  binding  to  the  essential  enzyme  I, 
thereby  reducing  the  inhibition  of  I  by  abi,  and  hence  reducing  the 
toxicity  of  abi.  In  three  other  cell  types,  whose  common  property 
is  the  presence  of  C  and  at  least  one  of  the  other  two  targets,  A 
or  B,  the  antiinhibitor-containing  ci*  would  be  largely  seques¬ 
tered  by  C,  and  hence  inactive,  but  the  inhibitor-containing  abi 
would  be  sequestered  as  well,  by  either  A  or  B.  In  only  one  type 
of  cells,  those  that  lack  A  and  B  but  contain  C  (A”  B“  cells), 
is  the  inhibitor  abi  fully  available  for  interaction  with  I,  resultmg 
in  higher  toxicity  (Fig.  4).  The  differences  in  the  toxicity  of  abi  to 
different  cell  types  would  be  determined  by  the  relative  stoichio¬ 
metries  and  absolute  concentrations  of  the  cellular  targets  in¬ 
volved  (A,  B,  C,  and  I),  by  the  affinities  of  the  moieties  a,  b,  c,  i, 
and  i*  for  these  targets,  and  by  the  pharmacokinetic  properties  of 
abi  and  ci*. 

Straightforward  variations  of  the  abi  and  ci*  designs  that 
utilize  IC  and  the  properties  of  antiinhibitors  would  allow 
selective  targeting  of  any  one  of  the  eight  cell  types  that  differ 
by  the  presence  or  absence  of  three  macromolecular  targets. 
Moreover,  there  is  no  formal  limit  on  the  total  number  of 
negative  and/or  positive  targets  that  can  be  simultaneously 
sensed  by  regimens  that  employ  abi-  and  ci*-type  compounds 
bearing  multiple  interference  moieties.  Note  that  the  cell  type 
selectivity  of  regimens  such  as  abi  +  ci*  (Fig.  4)  is  analogous 
to  the  outcomes  of  the  Boolean  operations  ‘‘and,”  “or,”  and 
combinations  thereof. 

Stoichiometries,  Affinities,  and  Pharmacokinetics,  The  se¬ 
lectivity  of  the  proposed  compounds  results  from  mutually 
exclusive,  competing  interactions  between  individual  moieties 
of  these  compounds  and  their  macromolecular  targets  (Figs. 
2-4);  hence,  the  importance  of  the  targets’  intracellular  con¬ 
centrations,  relative  to  each  other  and  the  enzyme  I,  which  is 
inhibited  by  an  effector  moiety  of  these  drugs.  The  choice  of 
I  is  not  confined  to  essential  enzymes.  The  target  I  could  be, 
for  instance,  a  DNA-binding  repressor  of  terminal  differenti¬ 
ation,  a  repressor  of  apoptosis,  or,  in  the  example  of  a 
noncytotoxic  therapy,  a  neurotransmitter-inactivating  enzyme. 
In  other  words,  the  choice  of  I  is  determined  by  the  intended 
effect  of  the  (unsequestered)  compound  abi. 

The  sequestration  of  abi-  and  ci*-type  compounds  by  their 
macromolecular  ligands  A,  B,  and  C  serves  to  prevent  their 
binding  to  the  enzyme  I  (Figs.  3  and  4).  Therefore,  in  schemes 
of  the  type  considered  above,  the  molar  concentration  of  I 
should  be  significantly  (if  possible,  considerably)  lower  than 
the  molar  concentrations  of  A,  B,  and  C.  In  addition,  the 
concentration  of  ci*  in  a  ci*  +  abi  regimen  should  significantly 
exceed  that  of  abi,  because  ci*  is  the  sole  obstacle  to  the 
inhibition  of  enzyme  I  by  abi  in  the  A"*  B”  C“  ceils  (Fig.  4). 
It  is  assumed,  furthermore,  that  the  total  intracellular  con¬ 
centrations  of  abi  and  ci*,  bound  and  unbound,  would  remain 
significantly  below  the  concentrations  of  the  interference 
targets  A,  B,  and  C.  The  affinities  of  A,  B,  C,  and  I  for  the 
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Fig.  4.  Combinatorial  (positive/ 
negative)  selectivity  of  a  regimen  that 
utilizes  interference/codominance 
and  antiinhibitor.  The  diagram  tabu¬ 
lates  the  relative  toxicities  of  the  com¬ 
pound  abi  in  the  setting  of  a  two- 
compound  treatment  of  cells  that  ei¬ 
ther  lack  or  contain  the 
macromolecular  targets  A,  B,  and  C. 
The  inhibitor-containing  compound 
abi  is  described  in  the  main  text  and 
Figs.  1,  2,  and  3C.  The  antiinhibitor- 
containing  compound  ci*  is  described 
in  the  main  text  and  Fig.  3.  A  and  B  are 
negative  targets,  in  that  they  reduce, 
through  the  binding  to  the  moieties  a 
and  b  of  abi,  the  inhibition  of  an 
essential  enzyme  I  by  abi.  C  is  a 
positive  target,  in  that  it  reduces, 
through  the  binding  to  the  c  moiety  of 
ci*,  the  binding  of  ci*  to  enzyme  L 
This  results  in  a  larger  fraction  of 
enzyme  I  available  for  the  inhibition 
by  abi.  It  is  assumed  that  the  concen¬ 
trations  of  A,  B,  and  C  in  cells  that 
contain  them  significantly  exceed  the 
concentration  of  I  (see  the  main  text). 
In  all  of  the  cell  types  except  A“  B“ 
C**",  the  enzyme  I  would  be  at  most 
partially  inhibited  by  the  i  moiety  of 
abi,  because  of  the  competing  inter¬ 
actions  of  abi  with  A  and/or  B,  and 
also  because  in  C“  cells  a  fraction  of 
enzyme  I  would  be  protected  from  the 
inhibition  by  abi  through  the  interac¬ 
tion  of  I  with  the  antiinhibitor  moiety 
i*  of  ci*.  By  contrast,  in  A~  B”  C'*' 
cells,  the  antiinhibitor-containing  ci* 
would  be  sequestered  by  C,  whereas 
abi  would  not  be  sequestered  by  A  or 
B,  which  are  absent  from  these  cells. 
As  a  result,  a  larger  fraction  of  the  inhibitor-containing  abi  molecules  would  be  available  for  the  interaction  with  I,  resulting  in  the  selective  toxicity 
of  abi  to  A”  B”  cells.  The  selectivity  pattern  of  abi  requires  that  certain  pharmacokinetic  conditions  are  met  as  well  (see  the  main  text).  Note 
that  the  physiological  effects  and  the  uses  of  abi-type  compounds  are  not  confined  to  cytotoxic  regimens. 


Cell  type 

Major  complexes 

Minor  complexes 

Toxicity 

|a  j  ^ 

^05 

Low 

A" 

Low 

A+B"C+ 

Low 

A^B'^C' 

mm 

Low 

A^B’C" 

mm 

Low 

A"  B‘ 

A'  B^C" 

mm 

A"  B"  C 

Low 

respective  moieties  of  the  compounds  abi  and  ci*,  and  also  the 
targets’  intracellular  locations  are  among  the  independent 
parameters  that  can  be  varied  in  designing  these  compounds. 

Yet  another,  and  major,  constraint  on  the  nature  and 
pharmacokinetics  of  multitarget  drugs  stems  from  the  fact  that 
the  selectivity  patterns  described  above  (Figs.  2  and  4)  may  not 
be  observed  under  equilibrium  conditions,  where  the  influx  of 
a  drug  into  cells  equals  its  outflux.  To  illustrate  one  clear 
difficulty,  let  us  oversimplify  and  suppose  that  an  abi-type 
compound  is  metabolically  inert,  in  addition  to  being  capable 
of  crossing  the  plasma  membranes  and  other  lipid  bilayers.  If 
abi  is  initially  outside  the  cells,  and  if  the  extracellular  pool  of 
abi  (e.g.,  in  the  blood  plasma)  is  large  enough,  it  can  be  shown 
that  the  subsequently  reached  equilibrium  state  would  be 
characterized  by  equal  concentrations  of  the  free  abi  in  both 
the  A***  B"**  and  A”  B“  cells,  thereby  resulting  in  the  equal 
occupancies  of  the  enzyme  I  by  abi  in  these  cells,  contrary  to 
the  pattern  illustrated  in  Fig.  2. 

By  contrast,  the  selectivity  patterns  of  Figs.  2  and  4  would  be 
observed  during  the  initial  influx  of  drugs  into  cells.  Thus,  one 
requirement  for  the  multitarget  selectivity  of  abi-  and  ci*-type 
regimens  is  the  avoidance  of  equilibrium  states  such  as  the  one 
described  above.  This  and  related  considerations  indicate  that 
despite  the  logical  simplicity  of  the  proposed  designs,  their 
implementation  will  have  to  address  pharmacokinetic  prob¬ 
lems  that  do  not  necessarily  arise  with  single-target  drugs. 

Selection  of  Targets  and  Construction  of  Multitarget  Drugs. 
Appropriate  macromolecular  targets  of  the  A-C  class  (Figs.  3  and 
4)  are  suggested  by  the  protein  composition  of  the  tumor  cells  to 


be  eliminated.  The  choice  of  an  essential  intracellular  enzyme  I 
(Figs.  2-4)  is  determined  by  the  presence  of  I  at  least  in  tumor  cells, 
its  physiological  concentration,  and  the  feasibility  of  an  efficacious 
inhibitor  of  I.  Among  potentially  suitable  enzymes  for  which 
cell-penetrating  inhibitors  already  exist  is  dihydrofolate  reductase. 
Its  high-affinity  inhibitors  include  methotrexate,  which  enters  cells 
through  carrier-mediated  pathways,  and  the  more  lipophilic  tri- 
metrexate,  which  can  enter  cells  by  diffusing  through  lipid  bilayers 
(18). 

Each  of  the  modules  in  the  abi-  and  ci*-type  compounds  (Figs. 
1-4)  would  bind  its  macromolecular  target  in  the  absence  of  the 
other  modules.  Therefore,  the  cytotoxic  i-type  modules  of  IC- 
based  compounds  would  be  similar  to  the  stand-alone  cytotoxic 
drugs  of  today.  By  contrast,  the  interference  modules  of  these 
compounds,  i.e.,  their  a-,  b-,  and  c-type  moieties  (Figs.  1-3),  are 
supposed  to  bind  to  their  macromolecular  targets  but  preferably 
not  impair  them  functionally.  This  specification  of  a  competition 
module  simplifies  its  design  in  comparison  to  that  of  inhdbitors, 
because  many  sites  on  the  target’s  surface,  and  not  just  the  active 
site,  would  be  acceptable. 

Antiinhibitors  (Fig.  3)  are  a  new  class  of  physiologically  active 
compounds.  The  opportunities  and  problems  of  their  design  are 
similar  to  those  for  the  interference  moieties  a-c  (Figs.  1-3),  but 
there  are  two  other  difficulties  as  well.  First,  the  antiinhibitor 
moiety  i*  must  bind  in  the  vicinity  of,  but  not  at,  the  active  site 
of  enzyme  I.  Second,  the  moiety  i*  must  also  bear  chemical 
groups  (a  bump)  whose  function  is  to  preclude,  through  steric 
hindrance,  the  binding  of  the  inhibitor  moiety  i  to  the  enzyme  I 
(Fig.  3). 
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A  substrate-binding  cleft  is  not  the  only  indentation  in  a 
folded  protein  molecule.  The  other  clefts  tend  to  be  smaller, 
but  they  are  present  as  well  (19),  and  some  of  them  may  be 
located  next  to  the  enzyme's  active  site  (Fig.  ,?).  In  addition, 
even  relatively  flat  molecular  surfaces  can  be.  in  principle,  the 
sites  of  high-affinity  interactions  with  small  ligands  (20).  These 
optimistic  comments  notwithstanding,  the  development  of 
antiinhibitors  is  certain  to  be  a  complex  undertaking.  As 
discussed  above,  the  pharmacokinetic  aspects  of  the  proposed 
designs  are  also  complex.  Yet  simplicity  is  good  only  if  it  works. 
Single-target  anticancer  drugs  remain  unsatisfactory,  in  spite 
of  decades  of  immense  effort.  It  may  therefore  be  wise  to 
attempt  a  more  complex  but  also  more  effective  solution. 

A  recent  advance  in  drug  design,  termed  SAR  by  NMR 
(structure-activity  relationships  by  nuclear  magnetic  reso¬ 
nance).  provides  an  especially  promising  route  to  constructing 
ligands  for  specific  regions  of  a  protein  molecule  (21).  In  this 
approach,  a  library  of small  molecules  is  screened  for  binding 
to  an  ^-^N-labeled  protein  by  using  NMR,  which  can  detect 
weak  interactions,  and  in  addition  assigns  them  to  specific 
nitrogens  of  a  protein,  thereby  identifying  the  site  of  binding. 
Finding  two  small  compounds  that  bind  to  adjacent  patches  of 
the  target  protein  molecule  and  covalently  linking  these  com¬ 
pounds  produces  a  higher-affinity  ligand.  This  powerful  strat¬ 
egy  (21).  which  already  yielded  tightly  binding  ligands  of 
specific  proteins,  may  prove  sufficient  for  constructing  the  a-c 
competition  modules  and  the  i*  antiinhibitor  modules  of  the 
proposed  designs  (Figs.  1-4). 

Noncytotoxic  Multitarget  Drugs,  Many  useful  drugs  are  the 
inhibitors  of  intracellular  enzymes  that  are  not  essential  for  cell 
viability  (2).  The  problem  of  insufficient  selectivity  is  relevant 
to  these  drugs  as  well.  For  example,  even  an  exquisitely  specific 
inhibitor  of  a  clinically  relevant  enzyme  is  likely  to  have 
significant  side  effects,  because  the  target  enzyme  is  present, 
in"  most  cases,  not  only  in  the  cells  where  its  inhibition  is 
clinically  beneficial  but  also  in  the  cells  where  its  inhibition  is 
physiologically  inappropriate.  The  logic  of  abi-type  inhibitors 
(Figs.  1  and  2)  and  ci*-type  antiinhibitors  (Figs.  3  and  4)  is 
applicable  in  these  settings,  because  an  informed  choice  of  the 
competition  moieties  a.  b,  and  c  would  sharpen  up  the  cell 
selectivity  of  the  inhibitor  moiety  i  in  the  way  described  above 
for  cvtotoxic  drugs  (Fig.  4),  resulting  in  the  inhibition  of  the 
(nonessential)  enzyme  I  in  a  predetermined  subset  of  the 
enzvme-containing  cells.  Note  that  the  same  considerations 
apply  to  extracellular  settings  as  well.  The  examples  above  are 
but  a  glimpse  of  the  drug-engineering  vistas  that  are  opened 
up  by  the  IC  and  antieffector  concepts.  At  the  same  time,  there 
are  significant  pharmacokinetic  constraints  on  the  properties 
of  the  proposed  drugs,  as  discussed  above.  These  constraints 
are  likely  to  complicate  the  implementation  of  the  IC/ 
antieffector  strategies. 

The  Problem  of  Drug  Resistance.  With  small  anticancer 
drugs  that  are  in  use  today,  the  macromolecular  target  of  a 
drug  serves  two  distinct  functions.  First,  the  target  is  a 
cell-selectivitv  determinant  that  may  bias  the  treatment  against 
tumor  cells.  Second,  the  target  is  also  a  device  whose  inhibition 
by  the  drug  brings  about  the  desired  effect,  e.g.,  cell  death. 
Consequently,  when  drug-resistant  tumor  cells,  selected  by  a 
drug  treatment,  present  themselves  as  a  cancer  recurrence,  the 
necessitv  of  employing  another  therapeutic  agent  (if  such  an 
option  exists)  robs  the  physician  of  whatever  cell-selectivity 
advantage  there  was  with  the  earlier  drug. 

The  situation  is  qualitatively  different  with  IC-based  com¬ 
pounds.  Suppose  that  a  treatment  that  included  the  drug  abi 
(Figs.  1  and  2)  results  in  the  appearance  of  abi-resistant  tumor 
cells  that  contain,  for  example,  an  altered  or  overproduced 
enzyme  I.  If  so,  replacement  of  the  i  moiety  by  another  small 
cytotoxic  moiety,  specific  for  another  essential  enzyme,  would 
retain  the  cell  selectivity  of  the  new  ab-containing  drug.  Thus, 


one  advantage  of  modularity  inherent  in  the  designs  of  IC/ 
antiinhibitor-based  compounds  (Figs.  1-4)  lies  in  the  separa¬ 
tion  of  the  effector  aspect  of  a  drug  from  its  selectivity  aspect. 
As  a  result,  once  an  efficacious  arrangement  of  the  selectivity 
modules  in  abi-  or  ci*-typc  compounds  has  been  identified,  it 
can  be  reutilized  in  drugs  bearing  effector  moieties  other  than 
i  and  i*. 

Concluding  Remarks.  The  above  considerations  are  based 
on  the  existing  understanding  of  single-target  drugs  and  on  the 
notion  of  steric  hindrance.  By  introducing  the  new  concept  of 
antieffectors  and  a  modification  of  the  previously  proposed 
idea  of  codominant  interference  (14).  we  can  now  attempt  the 
construction  of  small  modular  compounds  that  possess  a 
multitargct,  combinatorial  selectivity  (Fig.  4).  The  IC/ 
antieffec'tor  strategies  are  not  confined  to  cytotoxic  therapies 
and  are  relevant,  in  principle,  to  all  pharmacological  settings. 
As  indicated  above,  one  expected  difficulty  in  implementing 
these  strategies  stems  from  significant  pharmacokinetic  con¬ 
straints  that  do  not  necessarily  arise  with  single-target  drugs. 

This  work  was  motivated  by  the  premise  that  the  confine¬ 
ment  of  anticancer  drug  research  and  development  to  single¬ 
target  compounds  will  prove  insufficient  for  the  task  at  hand, 
because  even  the  informed  choices  of  targets  for  such  drugs 
may  not  define  unambiguously  enough  the  cell  type  to  be 
eliminated.  The  remedy,  described  above,  is  to  aim  for  drugs 
that  possess  qualitatively  different  selectivity— multitarget  and 
combinatorial.  If  this  view  is  correct,  the  future  ascent  of 
multitarget  drugs  may  transform  not  only  the  treatment  of 
cancer  but  also  approaches  in  other  settings  where  the  killing 
or  modification  of  undesirable  cells  or  organelles  is  carried  out 
in  the  presence  of  nearly  identical  cells  or  organelles  that  must 
be  spared.  These  applications  of  multitarget  drugs  encompass 
more  discriminating  antiviral  and  antifungal  therapies,  as  well 
as  the  selective  killing  of  activated  lymphocytes  in  autoimmune 
diseases  and  the  selective  elimination  of  damaged  mitochon¬ 
dria  in  aging  cells  (14, 15).  In  yet  another  class  of  applications, 
a  noncytotoxic  multitarget  drug  would  be  used  to  inhibit  a 
clinically  relevant  nonessential  enzyme  in  a  specific  subset  of 
the  enzyme-containing  cells,  thereby  retaining  the  benefits  of 
inhibition  while  reducing  its  side  effects. 
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ABSTRACT  The  N-end  rule  relates  the  in  vivo  half-life  of  a 
protein  to  the  identity  of  its  N-terminal  residue.  The  N-end  rule 
pathway  is  one  proteolytic  pathway  of  the  ubiquitin  system.  The 
recognition  component  of  this  pathway,  called  N-recognin  or  E3, 
binds  to  a  destabilizing  N-terminal  residue  of  a  substrate  protein 
and  participates  in  the  formation  of  a  substrate-linked  multiu- 
biquitin  chain.  We  report  the  cloning  of  the  mouse  and  human 
Ubrl  cDNAs  and  genes  that  encode  a  mammalian  N-recognin 
called  E3a.  Mouse  UBRlp  (E3a)  is  a  1,757-residue  (200-kDa) 
protein  that  contains  regions  of  sequence  similarity  to  the 
225-kDa  Ubrlp  of  the  yeast  Saccharomyces  cerevisiae.  Mouse  and 
human  UBRlp  have  apparent  homologs  in  other  eukaryotes  as 
well,  thus  defining  a  distinct  family  of  proteins,  the  UBR  family. 
The  residues  essential  for  substrate  recognition  by  the  yeast 
Ubrlp  are  conserved  in  the  mouse  UBRlp.  The  regions  of 
similarity  among  the  UBR  family  members  include  a  putative 
zinc  finger  and  RING-H2  finger,  another  zinc-binding  domain. 
Ubrl  is  located  in  the  middle  of  mouse  chromosome  2  and  in  the 
syntenic  15ql5-q21.1  region  of  human  chromosome  15.  Mouse 
Ubrl  spans  *='120  kilobases  of  genomic  DNA  and  contains  ^50 
exons.  Ubrl  is  ubiquitously  expressed  in  adults,  with  skeletal 
muscle  and  heart  being  the  sites  of  highest  expression.  In  mouse 
embryos,  the  Ubrl  expression  is  highest  in  the  branchial  arches 
and  in  the  tail  and  limb  buds.  The  cloning  of  Ubrl  makes  possible 
the  construction  of  U^ri-lacking  mouse  strains,  a  prerequisite 
for  the  functional  understanding  of  the  mammalian  N-end  rule 
pathway. 


A  number  of  regulatory  circuits  involve  metabolically  unstable 
proteins.  Short />z  vivo  half-lives  are  also  characteristic  of  damaged 
or  otherwise  abnormal  proteins  (1-4-).  Features  of  proteins  that 
confer  metabolic  instability  are  called  degradation  signals,  or 
degrons.  The  essential  component  of  one  degradation  signal, 
called  the  N-degron,  is  a  destabilizing  N-terminal  residue  of  a 
protein  (5, 6).  The  set  of  amino  acid  residues  that  are  destabilizing 
in  a  given  cell  type  yields  a  rule,  called  the  N-end  rule,  which 
relates  the  in  vivo  half-life  of  a  protein  to  the  identity  of  its 
N-terminal  residue.  Similar,  but  distinct,  versions  of  the  N-end 
rule  pathway  are  present  in  all  organisms  examined,  from  mam¬ 
mals  to  fungi  and  bacteria  (6-8). 

In  eukaryotes,  the  N-degron  comprises  two  determinants:  a 
destabilizing  N-terminal  residue  and  an  internal  lysine  or  lysines 
(8).  The  Lys  residue  is  the  site  of  formation  of  a  multiubiquitin 
chain  (9).  The  N-end  rule  pathway  is  thus  one  pathway  of  the 
ubiquitin  (Ub)  system.  Ub  is  a  76-residue  protein  whose  covalent 
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conjugation  to  other  proteins  plays  a  role  in  a  multitude  of 
processes,  including  cell  growth,  division,  differentiation,  and 
responses  to  stress  (1, 3, 4, 10).  In  most  of  these  processes,  Ub  acts 
through  routes  that  involve  the  degradation  of  Ub-protein  con¬ 
jugates  by  the  26S  proteasome,  an  ATP-dependent  multisubunit 
protease  (11). 

The  N-end  rule  is  organized  hierarchically.  In  the  yeast  Sac¬ 
charomyces  cerevisiae,  Asn  and  Gin  are  tertiary  destabilizing 
N-terminal  residues  in  that  they  function  through  their  enzymatic 
deamidation  into  the  secondary  destabilizing  N-terminal  residues 
Asp  and  Glu  (12).  The  destabilizing  activity  of  N-terminal  Asp 
and  Glu  requires  their  enzymatic  conjugation  to  Arg,  one  of  the 
primary  destabilizing  residues  (6).  The  primary  destabilizing 
N-terminal  residues  are  bound  directly  by  the  UBRi -encoded 
N-recognin  (also  called  E3),  the  recognition  component  of  the 
N-end  rule  pathway  (13).  In  5.  cerevisiae,  N-recognin  is  a  225-kDa 
protein  that  binds  to  potential  N-end  rule  substrates  through  their 
primary  destabilizing  N-terminal  residues — Phe,  Leu,  Trp,  Tyr, 
lie,  Arg,  Lys,  and  His.  N-recognin  has  at  least  two  substrate¬ 
binding  sites.  The  type  1  site  is  specific  for  the  basic  N-terminal 
residues  Arg,  Lys,  and  His.  The  type  2  site  is  specific  for  the  bulky 
hydrophobic  N-terminal  residues  Phe,  Leu,  Trp,  Tyr,  and  lie  (6). 

The  known  functions  of  the  N-end  rule  pathway  include  the 
control  of  peptide  import  in  5.  cerevisiae  (through  degradation  of 
Cup9p,  a  transcriptional  repressor  of  the  peptide  transporter 
Ptr2p);  a  role  in  controlling  the  Slnlp-dependent  phosphoryla¬ 
tion  cascade  that  mediates  osmoregulation  in  S.  cerevisiae',  the 
degradation  of  Gpalp,  a  Ga  protein  of  5.  cerevisiae;  and  the 
degradation  of  alphaviral  RNA  polymerases  in  virus-infected 
metazoan  cells  (6,  14). 

The  mammalian  counterpart  of  the  yeast  UBRl -encoded  N- 
recognin  (E3)  was  characterized  biochemically  in  extracts  from 
rabbit  reticulocytes  (15-17).  Rabbit  E3a  was  shown  to  be  spe- 


Abbreviations:  Ub,  ubiquitin;  kb,  kilobase;  id.,  identity;  si.,  similarity; 
BAG,  bacterial  artificial  chromosome;  FISH,  fluorescence  in  situ 
hybridization;  cn,  embryonic  day. 

Data  deposition:  Nucleotide  sequences  reported  in  this  work  have 
been  deposited  in  the  GenBank  database  [accession  nos.  AF061555 
(mouse  Ubrl  cDNA)  and  AF061556  (human  UBRl  cDNA)]. 

*‘‘To  whom  reprint  requests  should  be  addressed  at:  Division  of 
Biology,  147-75,  Caltech,  1200  East  California  Boulevard,  Pasadena, 
CA  91125.  e-mail:  avarsh@cco.caItcch.edu. 

7tThe  names  of  mouse  genes  are  in  italics,  with  the  first  letter 
uppercase.  The  names  of  human  and  S.  cerevisiae  genes  are  also  in 
italics,  all  uppercase.  If  human  and  mouse  genes  are  named  in  the 
same  sentence,  the  mouse  gene  notation  is  used.  The  names  of  S. 
cerevisiae  proteins  are  Roman,  with  the  first  letter  uppercase  and  an 
extra  lowercase  “p"  at  the  end.  The  names  of  the  corresponding 
mouse  and  human  proteins  are  the  same,  except  that  all  letters  but 
the  last  “p”  are  uppercase.  The  latter  usage  is  a  modification  of  the 
existing  convention  (33).  to  facilitate  simultaneous  discussions  of 
yeast,  mouse,  and  human  proteins.  In  some  citations,  the  abbrevi¬ 
ated  name  of  a  species  precedes  the  gene's  name. 
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Fig.  1.  Peptides  of  rabbit  UBRlp  (E3a)  and  isola¬ 
tion  of  the  mouse  Ubrl  cDNA.  (A)  Amino  acid  se¬ 
quences  of  tryptic  peptides  of  the  purified  rabbit 
UBRlp  (see  Materials  and  Methods).  The  alternative 
sets  of  peptide  names,  T-based  and  PEP1-PEP3  (in 
parentheses),  refer  to  two  different  preparations  of 
E3a.  The  sequences  of  T120,  T76,  T96,  and  T122  that 
were  encoded  DNA  sequences  identified  through 
intrapeptide  PCR  are  underlined.  Residues  deduced 
from  the  mouse  Ubrl  cDNA  that  differed  from  those 
inferred  through  peptide  sequencing  are  indicated  in  a 
smaller  font.  The  peptides’  positions  in  the  deduced 
sequence  of  mouse  UBRlp  are  indicated.  {B)  The 
intrapeptide/interpeptide-PCR  cloning  strategy.  The 
products  of  the  initial  intrapeptide  PCR,  derived  from 
rabbit  genomic  DNA,  were  used  to  carry  out  interpep¬ 
tide  PCRwith  a  rabbit  liver  cDNA  library  (CLON- 
TECH).  The  resulting  392-bp  fragment  of 
the  rabbit  Ubrl  cDNA  was  used  to  isolate,  using  PCR  and  a  Agtll  mouse  liver  cDNA  library,  the  corresponding  392-bp  mouse  Ubrl  cDNA  fragment. 
This  fragment  then  was  used  to  screen  the  same  cDNA  library,  yielding  a  2.4-kb  fragment  of  the  mouse  Ubrl  cDNA  that  encoded  several  of  the 
peptide-derived  sequences  of  the  rabbit  UBRlp.  The  encoded  sequence  was  also  significantly  similar  to  that  of  the  N-terminal  region  of  S,  cerevisiae  Ubrlp 
(13)  and  contained  the  putative  start  (ATG)  codon  of  the  mouse  Ubrl  ORF.  To  isolate  the  rest  of  the  5'  region  of  the  Ubrl  cDNA,  5 '-rapid  amplification 
of  cDNA  ends  (RACE)-PCR  (20)  was  performed  with  poly(A)-^  RNA  from  mouse  L  ceils  and  a  primer  from  the  2.4-kb  DNA  fragment.  3'-RACE-PCR 
(20)  was  used  to  amplify  a  downstream  region  of  Ubrl  cDNA.  The  resulting  DNA  fragment  (nucleotides  2,470-3,467)  then  was  used  to  screen  a  AgtlO 
mouse  cDNA  library  from  MEL-C19  cells.  Five  overlapping  cDNA  isolates  (MR16,  MR17,  MR19,  MR20,  and  MR23)  that  together  spanned  the  entire 
Ubrl  cDNA  were  mapped  and  subcloned  into  Bluescript  II SK+  (Stratagene),  gelding  the  plasmid  MR26,  which  contained  the  entire  ORF  of  Ubrl.  The 
ORF  region  of  Ubrl  cDNA  was  sequenced  on  both  strands  at  least  twice,  using  independently  derived  cDNA  clones. 
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cifically  required  for  the  Ub-dependent  degradation  of  proteins 
bearing  either  type  1  (basic)  or  type  2  (bulky  hydrophobic) 
destabilizing  N-terminal  residues  (7,  15,  16). 

We  began  dissection  of  the  mouse  N-end  rule  pathway  by 
isolating  the  Ntanl  gene,  which  encodes  the  asparagine-specific 
N-terminal  amidase  (18,  19),  a  component  of  the  mammalian 
N-end  rule  pathway,  and  by  constructing  mouse  strains  that  lack 
Ntanl  (Y.T.K.  and  A.V.,  unpublished  data).  Herein,  we  describe 
the  cloning  and  characterization  of  the  mouse  and  human  cDNAs 
and  genestt  that  encode  UBRlp  (E3q:),  a  homolog  of  yeast 
Ubrlp  and  the  main  recognition  component  of  the  N-end  rule 
pathway. 

MATERIALS  AND  METHODS 

Isolation  and  Partial  Sequencing  of  Mammalian  £3a 
(UBRlp).  Rabbit  E3a  was  purified  from  reticulocyte  extracts  by 
using  affinity  chromatography  with  immobilized  protein  sub¬ 
strates  of  UBRlp  and  elution  with  dipeptides  bearing  destabi¬ 
lizing  N-terminal  residues  (16).  The  resulting  preparation  was 


fractionated  by  SDS/PAGE.  The  band  of  «=^180-kDa  E3a  was 
excised  and  subjected  to  digestion  with  trypsin.  Amino  acid 
sequences  were  determined  for  14  peptides  of  rabbit  UBRlp  (Fig. 
L4)  by  using  standard  methods  (20). 

Isolation  of  the  Full-Length  Mouse  Ubrl  cDNA  A  strategy  that 
included  the  intrapeptide-interpeptide  PCR  (21)  was  used  (see 
the  legend  to  Fig.  1). 

Isolation  of  a  Partial  Human  UBRI  cDNA  Poly(A)^  RNA 
from  human  293  ceils  was  subjected  to  reverse  transcription- 
PCR,  using  sets  of  primers  corresponding  to  sequences  of  the 
mouse  Ubrl  cDNA.  One  of  the  reactions  yielded  a  1.0-kilobase 
(kb)  fragment  that  encompassed  a  region  of  the  human  UBRI 
cDNA  (Fig.  2). 

Mouse  and  Human  Genomic  Ubrl  Fragments.  A  library  of 
mouse  genomic  DNA  fragments  (strain  SvJ)  in  bacterial  artificial 
chromosome  (BAC)  (22)  vector  (Genome  Systems,  St.  Louis) 
was  used,  as  described  in  the  legend  to  Fig.  2. 

Northern,  Southern,  and  Whole-Mount  in  Situ  Hybrid¬ 
izations.  Mouse  and  human  multiple-tissue  Northern  blots 
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diagram  is  a  ^21-kb  fragment  of  the  human  UBRI  gene,  corresponding  to  1.0  kb  of  the  indicated  region  of  the  human  UBRI  cDNA  (nucleotides 
2,218-3,227  of  the  mouse  Ubrl  cDNA  sequence).  The  mouse  and  human  Ubrl  exons  are  denoted,  respectively,  by  numbers  and  letters.  Aso  indicated 
are  the  exon  locations  of  some  of  the  type  1  and  type  2  substrate-binding  sites  of  N-recognin  (the  essential  amino  acid  residues  are  underlined)  (A.  Webster, 
M.  Ghislain,  and  A.V.,  unpublished  data;  see  the  main  text).  Not  shown  are  the  114-bp  5 '-untranslated  region  (UTR)  and  the  1,010  bp  3'-UTR  of  the 
mouse  Ubrl  cDNA.  To  isolate  mouse  Ubrl,  a  library  of  mouse  genomic  DNA  fragments  in  a  BAC  vector  (see  Materials  and  Methods)  was  screened  with 
a  fragment  of  the  mouse  Ubrl  cDNA  (nucleotides  105-1,333)  as  a  probe,  yielding  seven  BAC  clones,  of  which  BAC3  and  BAC4  contained  the  entire 
Ubrl  gene.  The  exon/intron  organization  of  the  first  31  kb  (--1  /4)  of  the  mouse  Ubrl  gene  was  determined  by  using  exon-specific  PCR  primers  to  produce 
=^40  genomic  DNA  fragments  of  the  BAC3  insert  that  ranged  in  size  from  1.3  to  18  kb.  Regions  encompassing  the  exon/intron  junctions  then  were 
sequenced  by  using  intron-specific  primers.  Fragments  of  the  human  genomic  UBRI  DNA  were  isolated  by  using  primers  derived  from  the  1.0-kb  fragment 
of  the  human  UBRI  cDNA  the  Expand  High  Fidelity  PCR  System  (Roche  Molecular  Biochemicals,  Indianapolis,  IN),  and  genomic  DNA  from  human 
293  cells.  The  resulting  four  fragments  were  subcloned  into  pCR2.1  (Invitrogen),  yielding  the  plasmids  HR8,  HR6-4,  HR2-25,  and  HR7-2,  whose  partially 
overlapping  inserts  encompassed  ^21  kb  of  the  human  UBRI  gene.  Partial  sequencing  of  the  mouse  and  human  genomic  Ubrl  fragments  (*-20  kb  of 
sequenced  DNA)  included  all  of  the  exon/intron  junctions  in  these  regions  of  Ubrl. 
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Fig.  3.  Comparison  of  the  deduced  amino  acid  sequence  of  mouse  UBRlp  (Mm-UBRl)  with  those  of  C  elegans  UBRlp  (Ce-UBRl),  5. 
cerevisiae  Ubrlp  (Sc-UBRl),  and  K.  lactis  Ubrlp  (Kl-UBRl).  White-on-black  and  gray  shadings  highlight,  respectively,  identical  and  similar  residues. 
The  residues  of  UBR  proteins  that  are  identical  to  those  of  S.  cerevisiae  Ubrlp  are  denoted  by  double  dots,  at  positions  where  the  identity  involves 
just  one  nor[-cerevisiae  protein.  Also  indicated  are  the  regions  of  significant  similarity  among  the  four  proteins.  K.  lactis  UBRl  was  cloned  through 
its  crosshybridization  to  5.  cerevisiae  UBRl  (P.  Waller  and  A.V.,  unpublished  data). 


(CLONTECH),  and  either  mouse  or  human  Ubrl  cDNA 
fragments  labeled  with  were  used  (20).  Southern  hybrid¬ 
izations  were  carried  out  by  using  standard  techniques  (20). 
Mouse  embryos  were  staged,  fixed,  and  processed  for  in  situ 
hybridization  as  described  (23).  For  sectioning,  the  stained 
embryos  were  embedded  in  OCT  medium  (Sakura  Finetek, 
Torrance,  CA).  A  1.2-kb  Ubrl  cDNA  fragment  (nucleotides 


3,150-3,355)  was  used  as  a  template  for  synthesizing  anti- 
sense-  or  sense-strand  RNA  probes  labeled  with  digoxigenin 
(23). 

Chromosome  Mapping  of  the  Mouse  and  Human  Ubrl.  The 

mapping  of  mouse  Ubrl  was  carried  out  by  using  the  interspecific 
backcross  analysis  (24),  essentially  as  described  (18).  Human 
UBRl  was  mapped  by  using  fluorescence  in  situ  hybridization 
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Fig.  4.  Two  Cys/His  domains  of  the 
UBR  protein  family.  Comparison  of  the 
putative  zinc  finger  (region  I)  and 
RING-H2  finger  (region  IV)  with  the 
corresponding  sequences  from  the  other 
species  in  Fig.  3,  and  also  with  C.  albicans 
Ubrlp  (Ca-UBRl),  C  elegans  UBR2p 
(Ce-UBR2),  and  5.  cerevisiae  Ubr2p  (Sc- 
UBR2).  Numbers  indicate  the  lengths  of 
gaps.  The  conserved  Cys  and  His  residues 
are  indicated. 


(FISH)  with  mitotic  chromosomes  from  human  lymphocytes 
(25).  The  probe  was  a  mixture  of  the  HRS,  HR6-4,  HR2-25,  and 
HR7-2  plasmids,  labeled  with  biotin  using  biotinylated  dATP  and 
the  BioNick  labeling  kit  (Life  Technologies,  Grand  Island,  NY), 
and  detected  by  using  fluorescein  isothiocyanate-avidin. 

RESULTS  AND  DISCUSSION 

Isolation  of  the  Mouse  Ubrl  cDNA  Tryptic  peptides  of  the 
purified  rabbit  E3a  (UBRlp)  (16)  were  isolated  and  sequenced 
{stQ Materials  andMethods\y\t\^g  14  short  regions  of  E3a  (Fig. 
\A),  These  regions  lacked  significant  similarities  to  the  deduced 
sequence  of  S.  cerevisiae  Ubrlp  (13).  We  used  intrapeptide  PCR 
(21)  to  identify  a  unique  (nondegenerate)  sequence  of  the  rabbit 
Ubrl  cDNA.  This  method  allows  amplification  of  a  short  unique 
DNA  sequence  by  using  two  degenerate  PCR  primers  (derived  by 
reverse  translation)  that  flank  this  sequence  and  correspond  to 
the  outermost  regions  of  a  single  peptide  (Fig.  LB).  Several 
intrapeptide  nucleotide  sequences  were  obtained  this  way  (Fig. 
1).  These  sequences,  together  with  those  of  the  original  degen¬ 
erate  primers,  then  were  used  to  amplify  a  392-bp  fragment  of  the 
rabbit  Ubrl  cDNA,  using  interpeptide  PCR  (Fig.  1).  This  frag¬ 
ment  encoded  peptides  T120  and  T134  at  either  end  and  peptide 
TlOO  in  the  middle  (Fig.  \B),  A  homologous  392-bp  fragment  of 
the  mouse  Ubrl  cDNA  then  was  amplified  by  using  the  same 
method  (Fig.  IB).  The  rabbit  and  mouse  392-bp  Ubrl  cDNA 
fragments  were  88%  and  89%  identical  at  the  nucleotide  and 
amino  acid  sequence  levels,  respectively,  but  lacked  significant 
similarities  to  5.  cerevisiae  UBRl  (data  not  shown). 

The  392-bp  mouse  Ubrl  cDNA  fragment  then  was  used,  in 
conjunction  with  standard  cDNA  library  screening  and  rapid 
amplification  of  cDNA  ends-PCR  (20),  to  isolate  multiple  Ubrl 
cDNA  fragments,  and  to  assemble  them  into  a  5,271-bp  ORF 
encoding  a  1,757-residue  protein  (pi  of  6.0),  whose  size,  200  kDa, 
was  close  to  the  estimated  size  of  the  isolated  rabbit  UBRlp 
(E3a),  «=*180  kDa  (16)  (Figs.  2  and  3).  The  inferred  ATG  start 
codon  (Fig.  2),  within  the  sequence  CTTAAGATGGCG,  is 
preceded  by  two  in-frame  stop  codons,  at  positions  -48  and  -93, 
and  is  located  in  a  favorable  Kozak  context  (26),  with  A  and  G 
at  positions  -3  and  +4,  respectively.  There  are  two  more  ATGs, 
five  and  11  codons  downstream  of  the  inferred  one.  These 
alternative  start  codons  are  in  a  favorable  Kozak  context  as  well. 

Cloning  and  Partial  Characterization  of  the  Mouse  and  Hu¬ 
man  Ubrl  Genes.  A  fragment  of  the  mouse  Ubrl  cDNA  was  used 
to  isolate  a  '«120-kb  mouse  Ubrl  genomic  DNA  clone,  carried  in 
a  BAC  vector  (22).  We  determined  the  exon/intron  organization 
and  restriction  map  of  the  ^^31-kb  region  of  Ubrl  that  corre¬ 
sponded  to  the  1,340-bp  5 '-region  of  the  mouse  Ubrl  cDNA 
(nucleotides  105-1,333)  (Fig.  2).  The  lengths  of  the  12  exons  in 
this  region  of  mouse  Ubrl  range  from  63  to  257  bp  (Fig.  2). 

The  nucleotide  and  deduced  amino  acid  sequences  of  the 
1.0-kb  human  UBRl  cDNA  fragment  (see  Materials  and  Meth¬ 
ods),  located  approximately  in  the  middle  of  UBRl  cDNA 
(nucleotides  2,218-3,227  of  the  mouse  Ubrl  cDNA)  (Fig.  2), 
were,  respectively,  91%  and  94%  identical  to  the  corresponding 
mouse  Ubrl  cDNA  and  UBRlp  sequences.  Overlapping  genomic 


DNA  fragments  of  human  UBRl  that,  together,  encompassed  a 
'^21-kb  region  of  the  human  UBRl  gene  and  corresponded  to  the 
1.0-kb  fragment  of  the  human  UBRl  cDNA  (Fig.  2),  were  isolated 
from  human  DNA  by  using  cDNA-derived  primers  and  PCR. 
Partial  sequencing  showed  that  this  «='21-kb  region  of  human 
UBRl  contained  1 1  exons  whose  length  ranged  from  49  to  155  bp, 
a  distribution  of  exon  lengths  similar  to  that  in  a  different  region 
of  mouse  Ubrl  (Fig.  2).  All  of  the  sequenced  exon/intron 
junctions  («23  exons),  which  encompassed  a  «^52-kb  region  of 
the  mouse  and  human  Ubrl,  contained  the  consensus  GT  and  AG 
dinucleotides  characteristic  of  the  mammalian  nuclear  pre- 
mRNA  splice  sites  (data  not  shown)  (20).  Extrapolating  from 
these  data  on  the  mouse  and  human  Ubrl  genes  and  the  corre¬ 
sponding  regions  of  their  cDNAs  (Fig.  2),  a  mammalian  Ubrl 
gene  is  expected  to  be  ^120  kb  long  and  to  contain  «50  exons. 

The  Mouse  UBRlp  Protein  and  its  Homologs.  The  low  overall 
sequence  similarity  of  mouse  UBRlp  (E3a)  to  Ubrlp  of  either  S. 
cerevisiae  [22%  identity  (id.),  48%  similarity  (si.)]  or  another 
budding  yeast,  Kluyverorrryces  lactis  (21%  id.,  48%  si.),  belied  the 
presence  of  five  regions,  denoted  I-V,  which  were  significantly 
similar  between  the  mouse  and  yeast  versions  of  UBRlp  (Figs.  3 
and  4).  By  contrast,  the  Ntanl -encoded  asparagine-spedfic  N- 
terminal  amidase,  the  most  upstream  component  of  the  mouse 
N-end  rule  pathway,  lacks  sequence  similarities  to  its  S.  cerevisiae 
counterpart  Ntalp  (12,  18).  Database  searches  identified  other 
likely  homologs  of  mouse  UBRlp,  in  particular  the  1,927-residue 
protein  of  the  nematode  Caenorhabditis  elegans  (GenBank  ac¬ 
cession  no.  U88308)  (32%  id.,  53%  si.;  termed  Ce-Ubrl);  the 
1,872-residue  S.  cerevisiae  protein  (GenBank  accession  no. 
Z73196)  (21%  id.,  47%  si.;  termed  Sc-UBR2;  ref.  4);  the  2,168- 
residue  C  elegans  protein  (GenBank  accession  no.  U40029)  (21% 
id.,  45%  si.;  termed  Ce-Ubrl);  and  the  794-residue  ^R3p 
protein  of  the  iplzntArabidopsis  thaliana  (GenBank  accession  no. 
X95962)  (26%  id.,  49%  si.).  CER3p  is  involved  in  wax  biosyn¬ 
thesis  in^.  thaliana  (27).  In  addition,  a  147-residue  sequence  of 
the  yeast  Candida  albicans  (http://alces.med.umn.edu/bin/ 
genelist?LUBRl)  was  similar  to  the  N-terminal  region  of  mouse 
UBRlp  (Fig.  4). 

The  presence  of  high-similarity  regions  I-V  among  these 
deduced  sequences  (Figs.  3  and  4)  suggested  the  existence  of  a 
distinct  protein  family,  termed  UBR.  The  66-residue  region  I, 
near  the  N  terminus  of  UBRlp,  is  a  particularly  clear  UBR 
family-identifying  region  (e.g.,  61%  id,,  75%  si.  between  mouse 
and  C.  elegans  UBRlp)  (Figs.  3  and  4). 

Recent  genetic  an^yses  of  5.  cerevisiae  Ubrlp  (N-recognin) 
have  shown  that  the  regions  I-III  contain  residues  essential  for 
the  recognition  of  N-end  rule  substrates  by  Ubrlp.  In  particular, 
Cys-145,  Val-146,  Gly-173,  and  Asp-176  of  region  I  were  identi¬ 
fied  as  essential  residues  of  the  type  1  binding  site  of  S.  cerevisiae 
Ubrlp  (A.  Webster,  M.  Ghislain,  and  AV.,  unpublished  data). 
All  four  of  these  residues  were  conserved  between  the  yeast, 
mouse,  and  C.  elegans  UBRlp  (Figs.  3  and  4).  Region  I  is  present 
in  all  of  the  known  UBR  family  members  except  CER3p  of  .4. 
thaliana,  which  contains  only  regions  IV  and  V  (Fig.  4).  Region 
I  encompasses  a  Cys/His-rich  domain,  Cys-Xi2-Cys-X2-Cys-X5- 
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Fig.  5.  Northern  and  in  situ  hybridizations  with  mouse  and  human  Ubrl.  (A)  Membranes  containing  electrophoretically  fractionated  poIy(A)‘^ 
mRNA  from  different  mouse  (a-c)  or  human  (d  and  e)  tissues  were  hybridized  with  either  a  2-kb  5 '-proximal  (nucleotides  116-2,124)  mouse  Ubrl 
cDNA  fragment  (a),  its  0.64-kb  3'-proximal  (nucleotides  4,749-5,388)  fragment  (6),  a  1-kb  human  UBRl  cDNA  fragment  {d%  or  the  human  p-actin 
cDNA  fragment  (c  and  e).  The  upper  arrows  in  a  and  d  indicate  the  «8-kb  Ubrl  transcript.  The  lower  arrow  in  a  indicates  the  «='6-kb  testis-specific 
Ubrl  transcript.  In  the  RNA  sample  from  mouse  spleen,  the  Ubrl  transcript  (but  not  the  actin  transcript)  may  have  been  degraded  (a-c).  (B) 
Expression  of  Ubrl  in  elO.5  and  el  1.5  mouse  embryos.  Mdiole-mount  in  situ  hybridization  was  carried  out  with  either  antisense  (AS)  or  sense  (S, 
negative  control)  Ubrl  cDNA  probes  (see  Materials  and  Methods).  The  regions  of  high  Ubrl  expression  are  indicated  by  arrows  (t,  tail;  fl,  forelimb 
buds;  hi,  hindlimb  buds).  The  branchial  arches,  where  Ubrl  is  also  highly  expressed  in  elO.5  embryos  (data  not  shown),  are  not  visible  in  this  elO.5 
embryo.  (C)  Expression  of  Ubrl  in  the  surface  ectoderm  of  limb  buds.  Shown  is  a  transverse  section  of  a  forelimb  bud  of  an  elO.5  embryo  (se, 
surface  ectoderm).  (D)  FISH  analysis  of  human  UBRL  (Upper)  An  example  of  the  f/BRi -specific  FISH  signal  (arrow).  (Lower)  The  same  mitotic 
spread  stained  with  4'-6-diamino-2-phenylindole  (DAPI)  to  visualize  the  chromosomes  (see  also  Fig.  6). 


Cys-X2-Cys-X2-Cys-X5-His-X2-His-X(i2-i4)‘Cys-Xi-Cys-Xii-Cys 
(Figs.  3  and  4),  which  is  distinct  from  the  known  consensus 
sequences  of  zinc  fingers  and  other  Cys/His-motifs.  Residues 
Asp-318,  His-321,  and  Glu-560  of  S.  cerevisiae  Ubrlp,  which  have 
been  identified  as  essential  for  the  type  2  binding  site  of  this 
N-recognin  (A.  Webster,  M.  Ghislain,  and  A.V,,  unpublished 
data),  were  found  to  be  retained  in  region  II  (Asp-318  and 
His-321)  and  region  III  (Glu-560)  of  the  mouse  and  C.  elegans 
UBRlp  (Fig.  3). 

Region  IV  contains  another  Cys/His-rich  domain  of  UBRlp, 
Cys-Xz-Cysdoop  i-Cys-XrHis-X2-His-X2-Cys-/oqp  2-Cys-X2-Cys 
(Figs.  3  and  4),  which  is  present  in  all  of  the  UBR  family  members, 
and  fits  the  consensus  sequence  of  the  RING-H2  finger,  a 
subfamily  of  the  previously  defined  RING  motif  (28).  At  least 
some  of  the  RING-H2  sequences  are  sites  of  specific  protein- 
protein  interactions  (28).  Apcllp,  a  subunit  of  the  Ub-protein 
ligase  complex  called  the  cyclosome  (2)  or  the  anaphase  promot¬ 
ing  complex,  also  contains  a  RING-H2  finger  (29). 

Another  area  of  similarity  (24-50%  id.,  46-70%  si.)  among  the 
UBR  family  members  is  region  V  (Fig.  3  and  data  not  shovm). 
This  region,  115  residues  long  in  mouse  UBRlp,  near  the 
protein’s  C  terminus,  is  particularly  similar  between  mouse  and  C. 
elegans  UBRlp  (50%  id.,  70%  si.)  (Fig.  3).  Region  V  is  located 
4-14  residues  from  the  UBR  proteins’  C  termini,  the  exceptions 
being  the  S.  cerevisiae  and  fC  lactis  Ubrlp,  which  bear,  respec¬ 
tively,  132-  and  159-residue  tails  of  unknown  function  that  are  rich 
in  the  acidic  Asp/Glu  residues  (36%  and  33%)  (Fig.  3).  No 
significant  similarities  could  be  detected  between  mammalian 
UBRlp  and  other  E3s  (recognins)  of  the  metazoan  Ub  system, 


including  E6AP  (30)  and  subunits  of  the  cyclosome/anaphase 
promoting  complex,  except  for  the  presence  of  a  RING-H2  finger 
domain  in  the  latter  (29).  [Different  E3  proteins  of  the  Ub  system 
recognize  different  degrons  in  protein  substrates,  thereby  defin¬ 
ing  distinct  Ub-dependent  proteolytic  pathways  (1,  4).] 

Expression  of  Mouse  and  Human  Ubrl,  The  5'-  and  3'- 
proximal  mouse  cDNA  probes  yielded  similar  results,  detecting  a 
single  ^S-kb  transcript  in  sever^  tissues  (Fig.  5Aa  and  Ab).  In  the 
testis,  however,  the  «=^8-kb  species  of  Ubrl  mRNA  was  a  minor 
one,  the  major  species  being  ^6  kb  (Fig.  5Aa).  The  levels  of  either 
mouse  or  human  Ubrl  mRNA  were  highest  in  skeletal  muscle  and 
heart  (Fig.  5^).  The  expression  of  mRNA  encoding  E2i4k,  one  of 
the  mouse  Ub-conjugating  (E2)  enzymes  and  a  likely  component 
of  the  mouse  N-end  rule  pathway  (6),  was  also  highest  in  skeletal 
muscle  and  heart  (18). 

The  distinct  Ubrl  mRNA  pattern  in  the  testis  (Fig.  5A)  was 
reminiscent  of  the  analogous  expression  pattern  of  Ntanl 
mRNA,  which  encodes  the  Asn-specific  N-terminal  amidase, 
another  component  of  the  mammalian  N-end  rule  pathway. 
Specifically,  the  size  of  the  major  species  of  Ntanl  mRNA  was 
«=^1.4  kb  in  all  of  the  examined  mouse  tissues  except  testis, 
where  the  major  species  was  ^1.1  kb  (18).  The  Ntanl 

transcript  recently  was  found  to  hybridize  only  to  the  3 '-half 
(exons  6-10  but  not  exons  1-5)  of  the  Ntanl  ORF  (Y.T.K.  and 
A.V.,  unpublished  data).  The  functional  significance  of  the 
testis-specific  Ubrl  and  Ntanl  expression  patterns  remains  to 
be  understood. 

We  used  whole-mount  in  situ  hybridization  to  examine  the 
expression  of  Ubrl  during  embryogenesis.  In  e9.5  (9.5  days  old) 
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Fig.  6.  Chromosomal  locations  of  the  mouse  and  human  Ubrl 
genes.  (A)  Mouse  Ubrl  was  mapped  to  the  middle  of  mouse  chromo¬ 
some  2  by  using  interspecific  (M.  musculus-M.  spretus)  backcross 
analysis  (18,  24).  Shown  are  the  segregation  patterns  of  mouse  Ubrl 
and  the  flanking  genes  in  66  backcross  animals  that  were  typed  for  all 
loci.  For  individual  pairs  of  loci,  more  than  66  animals  were  typed. 
Each  column  represents  the  chromosome  identified  in  the  backcross 
progeny  that  was  inherited  from  the  [M.  musculus  C57BL/6J  X  M. 
spretus]  Fi  parent.  Filled  and  empty  squares  represent,  respectively, 
C57BL/6J  and  M.  spretus  alleles.  The  numbers  of  offspring  that 
inherited  each  type  of  chromosome  2  are  listed  below  the  columns,  (B) 
A  partial  mouse  chromosome  2  linkage  map  (MMU2),  showing  Ubrl 
in  relation  to  the  linked  genes  Thbsl,  Ebp4.2,  and  B2m,  and  also,  on 
the  left,  the  corresponding  recombination  distances  between  the  loci, 
in  centimorgans,  and  the  map  locations,  in  parentheses.  (C)  A  partial 
human  chromosome  15  linkage  map  (HSA15).  Each  dot  on  the  right, 
in  the  15ql5-q21.1  region,  corresponds  to  the  actually  observed 
-specific  double-dot  FISH  signal  detected  on  human  chromo¬ 
some  15  (see  also  Fig.  SD). 

mouse  embryos,  the  expression  of  Ubrl  was  highest  in  the 
branchial  arches  and  in  the  buds  of  forelimbs  and  the  tail  (data 
not  shown).  In  elO.5  embryos,  the  expression  of  Ubrl  became 
high  in  the  hindlimb  buds  as  well  (Fig.  5B).  This  pattern  was 
maintained  in  the  limb  buds  of  el  1.5  embryos  (Fig.  5B).  High 
expression  of  Ubrl  in  the  limb  buds  was  confined  predominantly 
to  the  surface  ectoderm  (Fig.  5C).  This  piattem  of  Ubrl  expression 
in  embryos  (Fig.  5  B  and  C)  is  similar,  if  not  identical,  to  that  of 
Ntanl,  which  encodes  asparagine-specific  N-terminal  amidase 
(18)  (Y.T.K.  and  A.V.,  unpublished  data),  consistent  with 
UBRlp  and  NTANlp  being  components  of  the  same  pathway. 

The  enhanced  expression  of  Ubrl  in  the  embryonic  limb  buds 
(Fig.  5  B  and  C)  is  interesting  in  view  of  the  conjecture  that  the 
N-end  rule  pathway  might  be  required  for  limb  regeneration  in 
amphibians  (31).  The  injection  of  dipeptides  bearing  destabilizing 
N-terminal  residues  into  the  stumps  of  amputated  forelimbs  of 
the  newt  was  observed  to  delay  limb  regeneration,  whereas  the 
injection  of  dipeptides  bearing  stabilizing  N-terminal  residues  had 
no  effect  (31).  Rigorous  tests  of  this  and  other  suggested  functions 
of  the  metazoan  N-end  rule  pathway  (6)  will  require  mouse 
strains  that  lack  Ubrl. 

Chromosome  Mapping  of  Mouse  and  Human  Ubrl,  The 

chromosomal  location  of  mouse  Ubrl  was  determined  by  inter¬ 
specific  backcross  analysis,  using  DNA  derived  from  matings  of 
[(C57BL/6J  X  Mas  spretus)¥ I  X  C57BL/6J]  mice  (Fig.  6  A  and 
B)  (18,  24).  Mouse  Ubrl  is  located  in  the  central  region  of 
chromosome  2  and  is  linked  to  the  Thbsl,Epb4.2,  and52m  genes, 
the  most  likely  gene  order  being  CQntxomQxc^’Thbsl‘Ubrl-Epb4.2- 
B2m  (Fig.  6B  and  data  not  shown). 

The  chromosomal  location  of  human  UBRl  was  determined 
by  using  FISH  (25),  with  human  UBRl  genomic  DNA  frag¬ 
ments  as  probes  (Figs.  5Z>  and  6C).  This  mapping  placed  UBRl 
at  the  15ql5-15q21.1  region  of  the  human  chromosome  15,  an 
area  syntenic  with  the  independently  mapped  position  of 
mouse  Ubrl  (Fig.  6).  Ubrl  is  located  in  the  regions  of  human 
chromosome  15  and  mouse  chromosome  2  that  appear  to  be 
devoid  of  the  previously  mapped  but  uncloned  mutations. 


Mutations  in  the  human  gene  CANP3.,  which  encodes  a  subunit 
of  calpain  and  is  located  very  close,  if  not  adjacent,  to  UBRl, 
have  been  shown  to  cause  a  myopathy  called  the  limb-girdle 
muscular  distrophy  (32). 

Concluding  Remarks.  Isolation  of  the  mouse  and  human  Ubrl 
cDNAs  and  genes  (Figs.  2-6)  should  enable  functional  under¬ 
standing  of  the  mammalian  N-end  rule  pathway,  in  part  through 
the  construction  and  analysis  of  mouse  strains  that  lack  Ubrl. 
Recent  searches  in  GenBank  identified  several  mouse  and  human 
sequences  in  expressed  sequence  tag  databases  that  exhibited 
significant  similarity  to  the  C-terminal  region  of  mouse  UBRlp. 
The  cloning  and  characterization  of  the  corresponding  cDNAs 
have  shown  that  there  exist  at  least  two  distinct  mouse  (and 
human)  genes,  termed  Ubr2  and  Ubr3,  which  encode  proteins  that 
are  significantly  similar  to  mouse  UBRlp  (Y.T.K.  and  A.V., 
unpublished  data).  Molecular  and  functional  analyses  of  these 
Ubrl  homologs  are  under  way. 
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Ubiquitin-dependent  proteolytic  systems  underlie 
many  processes,  including  the  cell  cycle,  cell  differenti¬ 
ation  and  responses  to  stress.  One  such  system  is  the 
N-end  rule  pathway,  which  targets  proteins  bearing 
destabilizing  N-terminal  residues.  Here  we  report  that 
Ubrlp,  the  main  recognition  component  of  this  path¬ 
way,  regulates  peptide  import  in  the  yeast  Saccharo- 
myces  cerevisiae  through  degradation  of  Cup9p,  a  35 
kDa  homeodomain  protein.  Cup9p  was  identified  using 
a  screen  for  mutants  that  bypass  the  previously 
observed  requirement  for  Ubrlp  in  peptide  import. 
We  show  that  Cup9p  is  a  short-lived  protein  (tiri 
-5  min)  whose  degradation  requires  Ubrlp.  Cup9p 
acts  as  a  repressor  of  PTR2,  a  gene  encoding  the 
transmembrane  peptide  transporter.  In  contrast  to 
engineered  N-end  rule  substrates,  which  are  recognized 
by  Ubrlp  through  their  destabilizing  N-terminal 
residues,  Cup9p  is  targeted  by  Ubrlp  through  an 
internal  degradation  signal.  The  Ubrlp-Cup9p-Ptr2p 
circuit  is  the  first  example  of  a  physiological  process 
controlled  by  the  N-end  rule  pathway.  An  earlier  study 
identified  Cup9p  as  a  protein  required  for  an  aspect 
of  resistance  to  copper  toxicity  in  S.cerevisiae.  Thus, 
one  physiological  substrate  of  the  N-end  rule  pathway 
functions  as  both  a  repressor  of  peptide  import  and  a 
regulator  of  copper  homeostasis. 

Keywords:  Cf/P9/N-end  rule/peptide  import/proteolysis/ 
PTR21UBR} 


Introduction 

Many  regulatory  proteins  are  short-lived  in  vivo  (Schwob 
et  al,  1994;  King  et  ai,  1996;  Varshavsky,  1996).  This 
metabolic  instability  makes  possible  rapid  adjustment 
of  the  protein’s  concentration  (or  subunit  composition) 
through  changes  in  the  rates  of  its  synthesis  or  degradation. 
Protein  degradation  plays  a  role  in  a  multitude  of  processes, 
including  cell  growth,  division,  differentiation  and 
responses  to  stress.  In  eukaryotes,  a  large  fraction  of 
intracellular  proteolysis  is  mediated  by  the  ubiquitin  sys¬ 
tem.  Ubiquitin  (Ub)  is  a  76-residue  protein  whose  covalent 
conjugation  to  other  proteins  marks  them  for  process! ve 
degradation  by  the  26S  proteasome — an  ATP-dependent, 
multisubunit  protease  (Jentsch  and  Schlenker,  1995;  Hilt 
and  Wolf,  1996:  Hochstrasser,  1996:  Rubin  et  ai,  1997). 


Features  of  proteins  that  confer  metabolic  instability 
are  called  degradation  signals  (degrons).  One  of  the 
degradation  signals  recognized  by  the  ubiquitin  system  is 
called  the  N-degron.  It  comprises  two  essential  determin¬ 
ants:  a  destabilizing  N-terminal  residue  and  an  internal 
lysine  of  a  substrate  (Bachmair  et  ai,  1986;  Varshavsky, 
1996).  The  Lys  residue  is  the  site  of  formation  of  a 
substrate-linked  multiubiquitin  chain  (Bachmair  and 
Varshavsky,  1989;  Chau  et  ai,  1989).  A  set  of  N-degrons 
bearing  different  N-terminal  residues  that  are  destabilizing 
in  a  given  cell  type  yields  a  rule,  called  the  N-end  rule, 
which  relates  the  in  vivo  half-life  of  a  protein  to  the 
identity  of  its  N-terminal  residue.  Similar  but  distinct 
versions  of  the  N-end  rule  operate  in  all  organisms 
examined,  from  mammals  to  fungi  and  bacteria 
(Varshavsky,  1996). 

The  N-end  rule  pathway  is  organized  hierarchically.  In 
eukaryotes  such  as  Saccharomyces  cerevisiae,  Asn  and 
Gin  are  tertiary  destabilizing  N-terminal  residues  in  that 
they  function  through  their  conversion,  by  the  NTAl- 
encoded  N-terminal  amidase  (Nt-amidase),  into  the 
secondary  destabilizing  residues  Asp  and  Glu  (Baker  and 
Varshavsky,  1995).  Secondary  residues,  in  turn,  function 
through  their  conjugation  to  Arg  by  the  ATE/ -encoded 
Arg-tRNA-protein  transferase  (R-transferase)  (Balzi  etai, 
1990).  Arg  is  one  of  several  primary  destabilizing 
N-terminal  residues  which  are  bound  directly  by 
N-recognin,  a  225  kDa  E3  protein  encoded  by  the  UBRl 
gene  (Bartel  et  ai,  1990).  Ubrlp,  together  with  the 
associated  ubiquitin-conjugating  (E2)  enzyme  Ubc2p, 
mediates  the  formation  of  a  substrate- linked  multiubiquitin 
chain  (Varshavsky,  1996). 

The  N-end  rule  pathway  was  first  encountered  in  experi¬ 
ments  that  explored,  in  S. cerevisiae,  the  metabolic  fate  of 
a  fusion  between  Ub  and  a  reporter  such  as  Escherichia 
coli  p-galactosidase  (P-gal)  (Bachmair  et  ai,  1986).  WTiile 
such  engineered  N-end  rule  substrates  have  been  extens¬ 
ively  characterized  (Varshavsky,  1996),  little  is  known 
about  their  physiological  counterparts.  The  few  identified 
so  far  include  the  GEA 7-encoded  Ga  subunit  of  the 
S.  cerevisiae  heterotrimeric  G  protein,  which  mediates  the 
pheromone  response  in  this  fungus,  and  RNA  polymerases 
of  alphaviruses  whose  hosts  include  mammalian  and  insect 
cells  (de  Groot  et  ai,  1991;  Madura  and  Varshavsky, 

1994) .  Physiological  functions  of  the  instability  of  these 
proteins  remain  to  be  understood  (Varshavsky,  1996). 
Inactivation  of  the  N-end  rule  pathway  in  S. cerevisiae — 
through  deletion  of  the  UBRI  gene — results  in  cells  which 
grow  slightly  slower  than  their  wild-type  counterparts, 
and  are  impaired  in  sporulation  (increased  frequency  of 
asci  containing  fewer  than  four  spores),  but  otherwise 
appear  to  be  normal  (Bartel  et  ai,  1990). 

Recently,  Becker  and  colleagues  (Alagramam  et  ai, 

1995)  have  reported  that  iibrJA  cells  are  deficient  in  the 
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Fig.  1.  The  import  ot  peptides  is  decreased  in  the  absence  of  Ubc2p  and  virtually  abolished  in  the  absence  of  Ubc2p  and  Ubc4p.  Saccharomyces 
cerevisiae  mutants  deficient  in  one  or  more  ubiquitin-conjugating  (E2)  enzymes  or  other  components  of  the  ubiquitin  system  were  tested  for  their 
ability  to  import  peptides,  using  the  halo  assay  and  a  toxic  dipeptide  L-leucyl-L-ethionine  (Leu-Eth)  (see  Materials  and  methods).  ‘  +  \  ‘±’  and 
denote,  respectively,  the  apparently  wild-type,  significantly  reduced,  and  undetectable  levels  of  peptide  import.  A  superscript  ‘T  refers  to  strains  that 
carried  a  ts  allele  of  an  essential  E2  enzyme,  either  Cdc34p  (Ubc3p)  or  Ubc9p;  the  30®C  temperature  of  the  test  was  semi -permissive  for  these 
strains.  (A)  Summary  of  the  results.  (B-F)  Examples  of  the  actual  halo  assays,  with  wild-type  (B),  ptrZA  (C),  uhr/A  (D),  ubc2A  (E),  ubc4A  (F)  and 
ubc2A  iihc4A  (G)  strains  ot  S. cerevisiae.  Elimination  of  some  E2  enzymes  in  the  uhc2A  background  restored  halo  formation,  presumably  because 
such  strains  were  growth-impaired  in  a  way  that  made  them  hypersensitive  to  the  toxicity  of  ethionine.  By  contrast,  although  ubc2A  ubc4A  cells 
were  also  growth-impaired,  they  were  import-defective  and  therefore  grew  in  the  immediate  vicinity  of  the  filter. 


import  of  di-  and  tripeptides,  suggesting  that  this  process, 
which  is  universal  among  living  cells,  requires  the  N-end 
rule  pathway.  In  the  present  work,  we  identified  the 
underlying  regulatory  mechanism  and  discovered  a  new 
physiological  substrate  of  the  N-end  rule  pathway,  the 
homeodomain  protein  Cup9p.  This  short-lived  protein 
is  targeted  for  degradation  by  Ubrlp,  and  acts  as  a 
transcriptional  repressor  of  PTR2,  a  gene  that  encodes  a 
transmembrane  peptide  transporter.  The  Ubrlp-Cup9p- 
Ptr2p  circuit  is  the  first  example  of  a  physiological  process 
controlled  by  the  N-end  rule  pathway. 

Results 

The  invoivement  of  Ubc2p  and  Ubc4p  E2  enzymes 
in  the  control  of  peptide  import 

We  began  by  asking  whether  components  of  the  N-end 
rule  pathway  other  than  Ubrlp  were  also  necessary  for 
the  import  of  peptides.  Previous  work  has  shown  that 
Ubc2p,  one  of  13  ubiquitin-conjugating  (E2)  enzymes  of 
S.cerevisiae,  is  required  for  the  degradation  of  engineered 
N-end  rule  substrates,  and  is  physically  associated  with 
the  E3  protein  Ubrlp  (N-recognin)  (Jentsch,  1992:  Madura 
et  ai,  1993). 

To  test  for  the  ability  of  S. cerevisiae  to  import  peptides, 
we  used  a  halo  assay,  in  which  a  filter  soaked  in  the  toxic 
dipeptide  L-leucyl-L-ethionine  (Leu-Eth)  is  placed  on  a 
plate,  inhibiting  the  growth  of  import-competent  cells  near 
the  filter.  By  this  test,  the  elimination  of  UBC2  impaired, 
but  did  not  abolish,  the  import  of  peptides  (Figure  lA,  B 
and  E).  To  determine  which  of  the  other  E2  enzymes,  if 


any,  were  required  for  the  residual  peptide  import  observed 
in  ubc2A  cells,  a  number  of  single  and  multiple  mutants 
in  UBC  genes  were  examined  (Figure  1).  We  found 
that  the  elimination  of  both  UBC2  and  UBC4  virtually 
abolished  the  import  of  peptides  (Figure  lA,  B  and 
E-G).  Elimination  of  Ubc4p,  one  of  the  more  abundant 
E2  enzymes  (Bachmair  et  ai,  1986;  Jentsch,  1992),  had 
previously  been  noticed  to  decrease  slightly  the  activity 
of  the  N-end  rule  pathway  (Bartel,  1990).  Cells  lacking 
either  Ntalp  or  Ate  Ip — the  ‘upstream’  components  of  this 
pathway — were  also  examined  and  found  unimpaired  in 
the  import  of  peptides,  in  contrast  to  ubrlA  and  ubc2A 
ubc4A  cells  (Figure  lA). 

Identification  of  Cup9p  as  a  negative  regulator  of 
peptide  import 

Previous  work  (Alagramam  et  ai,  1995)  has  shown  that 
deletion  of  UBRI  greatly  reduces  the  level  of  PTR2  mRNA, 
which  encodes  the  transmembrane  peptide  transporter.  This 
result,  and  our  observation  that  peptide  import  requires 
the  presence  of  at  least  one  of  two  specific  ubiquitin- 
conjugating  enzymes  (Figure  1),  suggested  a  model  in 
which  expression  of  the  Ptr2p  transporter  is  regulated  by 
a  short-lived  repressor  that  is  degraded  by  the  N-end  rule 
pathway.  In  cells  lacking  UBRL  the  repressor  would  be 
expected  to  accumulate,  thereby  blocking  peptide  import. 
One  prediction  of  this  model  is  that  inactivation  of  this 
repressor  would  bypass  the  requirement  for  Ubrlp  in 
peptide  import.  A  screen  for  such  ‘bypass’  mutations  (see 
Materials  and  methods)  yielded  199  recessive  isolates,  of 
which  101  defined  one  complementation  group,  termed 
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Fig.  2.  Cup9p  is  a  repressor  of  the  PTR2  gene.  (A)  Expression  of  the  S.cerevisiae  PTR2  gene  in  different  genetic  backgrounds.  Equal  amounts  of 
total  RNA  isolated  from  different  strains  were  analyzed  by  Northern  hybndization  (see  Materials  and  methods),  using  the  PTR2  (peptide  transporter) 
and  ACTl  (actin)  genes  as  ^“P-labeied  probes.  Lane  a.  JD52  {CUP9  UBRl)  (wild-type)  transformed  with  pCB201  (empty  vector).  Lane  b,  JD55 
{CUP9  ubrIA)  transformed  with  pCB20l.  Lane  c,  JD52  {CUP9  UBRJ)  transformed  with  the  high-copy  plasmid  pCB209  that  expressed  CUP9  from 
its  natural  promoter.  Lane  d.  CBY18  icup9A  UBRI).  Lane  e,  CBYI6  icup9A  uhrlA).  Lane  f,  CBY21  iCUP9  UBRI  ptrZA).  Lanes  g-i.  same  as  lanes 
a-c.  but  a  longer  autoradiographic  exposure  to  highlight  the  tight  repression  of  PTR2  by  Cup9p  in  ubrJA  cells  (b.  h)  and  the  difference  between 
levels  of  PTR2  mRNA  in  the  wild-type  cells  (a.  g)  and  their  counterpans  that  overexpressed  Cup9p  (c.  i).  (B)  Cup9p  specifically  binds  to  a  site  in 
the  ?PTR2  promoter.  A  gel  shift  assay  with  Cup9-Hf,,  poly-dl  dC  and  ^^P-labeled  DNA  fragments  of  the  PTR2  promoter  hybridization  (see  Materials 
and  methods).  Lanes  a-c,  with  a  fragment  (-1  to  -Ml)  proximal  to  the  inferred  stan  codon  of  the  PTR2  ORE.  Lanes  d-f.  same  as  lanes  a-c,  but 
with  a  more  distal  DNA  fragment  (-448  to  -897).  Concentrations  of  Cup9-Hf,  (in  pg/ml)  are  indicated  above  the  lanes. 


sub  I  (suppressor  of  a  block  to  peptide  import  in  ubrJA). 
In  agreement  with  the  modeTs  prediction,  sub  I  mutants 
acquired  the  ability  to  express  PTR2  in  the  absence  of 
UBRI  (Figure  2  A,  lane  b  versus  lane  e).  The  sub  I 
locus  was  cloned  by  complementation  (see  Materials  and 
methods),  and  was  found  to  be  the  CUP9  gene. 

CUP9  was  originally  identified  by  Knight  et  ai  (1994) 
as  a  gene  whose  disruption  impairs  the  copper  resistance 
of  S.cerevisiae  growing  on  lactate,  a  non-fermentable 
carbon  source.  Under  these  conditions,  Cup9p  plays  a 
major  (but  mechanistically  obscure)  role  in  copper 
homeostasis  (Knight  et  ai,  1994).  CUP9  encodes  a  35  kDa 
protein  that  contains  a  homeodomain,  an  --60-residue 
helix-tum-helix  DNA-binding  motif  present  in  many 
eukaryotic  regulatory  proteins  (Wolberger,  1996).  Outside 
the  homeodomain  region,  the  sequence  of  Cup9p  is  not 
similar  to  sequences  in  databases. 

To  verify  that  CUP9  and  SUB  I  were  the  same  gene, 
complementation  tests  were  carried  out.  Two  independ¬ 
ently  derived  ubrIA  cup9::LEU2  strains  (CBY16  and 
CBY17)  were  crossed  to  ubrlA  subl-1  (CBY15),  and  the 
resulting  diploids  (CBY23  and  CBY24,  respectively)  were 
tested  for  their  ability  to  import  dipeptides  (see  Materials 
and  methods).  As  would  be  expected  of  allelic  loci, 
cup9::LEU2  and  subl-1  failed  to  complement  one  another: 
both  diploid  strains  remained  import-competent  (data 
not  shown).  In  another  test,  CBY23  and  CBY24  were 
sporulated,  and  the  segregants  were  analyzed  for  the 
presence  of  the  LEU2  gene  (integrated  at  the  CUP9  locus) 
and  for  the  ability  to  import  peptides.  Among  the  eight 
tetrads  tested,  LEU2  was  present  in  two  of  the  four 
segregants.  whereas  all  four  segregants  were  invariably 
import-competent,  a  pattern  expected  if  CUP9  and  SUB  I 
were  one  and  the  same  gene. 


To  examine  the  regulation  of  peptide  import  by  Cup9p 
and  Ubrlp,  con  genic  S.cerevisiae  strains  that  lacked, 
expressed  or  overexpressed  Cup9p  and/or  Ubrlp  were  con¬ 
structed  and  assayed  for  peptide  import  by  growth  on  select¬ 
ive  media.  Suspensions  of  cells  (auxotrophic  for  lysine) 
were  serially  diluted  and  plated  on  either  rich  media,  min¬ 
imal  media  lacking  lysine  and  containing  Lys-Ala  dipeptide 
(selecting  for  peptide  import),  or  minimal  media  containing 
the  toxic  dipeptide  Leu-Eth  (selecting  against  peptide 
import).  All  strains  grew  at  comparable  rates  on  rich  media 
(Figure  3A).  On  minimal  media  that  supplied  the  essential 
lysine  as  the  Lys-Ala  dipeptide,  the  CUP9  UBRI  (wild- 
type),  cup9A  UBRI  and  cup9A  ubrIA  strains  grew  at  com¬ 
parable  rates  (Figure  3B),  whereas  the  CUP9  ubrIA  strain 
failed  to  grow  (Figure  3B),  in  agreement  with  the  observa¬ 
tion  that  UBRI  is  required  for  peptide  import  (Figure  1). 
Opposite  growth  patterns  were  observed  on  media  con¬ 
taining  toxic  dipeptide  (Figure  3C).  Comparison  of  the  data 
in  Figure  3  with  PTR2  mRNA  levels  (Figure  2A)  suggested 
that  CUP9  exerts  its  effect  on  the  import  of  peptides  by 
repressing  transcription  of  PTR2.  For  example,  wild-type 
(CUP9  UBRI)  cells  overexpressing  Cup9p  exhibited 
reduced  levels  of  PTR2  mRNA  (Figure  2A)  and  decreased 
sensitivity  to  toxic  dipeptide  (Figure  3C),  whereas  strains 
lacking  CUP9  {cup9A  UBRI  and  cup9A  ubrIA)  overex¬ 
pressed  PTR2  (Figure  2A)  and  were  hypersensitive  to  toxic 
dipeptide  (Figure  3C). 

Cup9p  is  a  repressor  of  the  PTR2  gene 

To  address  the  mechanism  of  repression  by  Cup9p,  we 
asked  whether  purified  Cup9p  (see  Materials  and  methods) 
could  selectively  bind  to  specific  regions  of  the  PTR2 
promoter.  Gel  shift  assays  in  the  presence  of  poly-dFdC 
competitor  DNA  were  performed  with  labeled  DNA 
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Fig.  3.  Relative  capacity  for  peptide  import  in  congenic  S.cerevisiae 
strains  that  contained,  lacked  or  overexpressed  Cup9p  and/or  Ubrlp. 
Serial  dilutions  of  the  indicated  strains  were  deposited  by  a  48-pin 
applicator  onto  either  nch  (YPD)  medium  (A),  minimal  medium 
containing  66  pM  Lys-Ala  dipeptide  as  the  sole  source  of  lysine  (B), 
or  minimal  medium  containing  both  lysine  (at  110  pM)  and  the  toxic 
dipeptide  Leu-Eth  (at  55  pM)  (C).  The  plates  were  incubated  at  30®C 
for  1-2  days.  The  relevant  genetic  loci  are  shown  on  the  left.  The 
'UBRL  overexpressed  CUP9'  strain  (JD52-2p-CUP9)  carried  the  high- 
copy  plasmid  pCB209  that  expressed  Cup9p  from  its  natural  promoter. 


fragments  of  the  PTR2  promoter  and  His6“tagged  Cup9p 
(Cup9p-H6)  purified  from  E.coli.  Cup9p-H6  bound  to  a 
site  within  a  distal  region  of  the  PTR2  promoter  (positions 
-448  to  -897  relative  to  the  inferred  CUP9  start  codon), 
but  did  not  bind  to  the  proximal  region  of  the  PTR2 
promoter  (positions  - 1  to  -447)  under  the  same  condi¬ 
tions  (Figure  2B). 

The  transcriptional  repressor  function  of  Cup9p  was 
further  suggested  by  the  finding  that  the  co-repressor 
complex  Tuplp/Ssn6p  plays  a  role  in  the  control  of  peptide 
import.  The  Tuplp/Ssn6p  complex  inhibits  transcription 
of  many  yeast  genes  through  interactions  with  gene- 
specific  DNA-binding  repressors  such  as  Mata2p  (Chen 
et  ai^  1993;  Smith  et  ai,  1995),  a  homeodomain  homolog 
of  Cup9p.  We  found  that  most  of  our  sub  mutants  that 
were  not  CUP9  mutants  could  be  complemented  by  low- 
copy  plasmids  bearing  SSN6  or  TUPI  (G.Tumer,  S.Saha 
and  A.  Varshavsky,  unpublished  data).  In  addition,  deletion 
of  SSN6,  like  deletion  of  CUP9,  restored  the  ability  of 
ubrlA  cells  to  import  peptides  (data  not  shown). 

Ubr1p-dependent  degradation  of  Cup9p 

The  fact  that  deletion  of  CUP9  renders  PTR2  transcription 
independent  of  Ubrlp  (Figure  2A),  and  the  observation 
that  overexpression  of  Ubrlp  enhances  the  import  of 
peptides  in  Cup9p-expressing  strains  (data  not  shown) 
suggested  that  the  N-end  rule  pathway  regulates  peptide 


import  by  targeting  Cup9p  for  degradation.  To  test  this 
conjecture,  we  carried  out  pulse-chase  experiments  with 
a  C-terminally  FLAG-tagged  Cup9p  in  UBRI  and  ubrlA 
cells.  Cup9p-FLAG  was  a  very  short-lived  protein  (/1/2 
~5  min)  in  UBRI  cells  (Figure  4).  By  contrast,  Cup9p 
was  much  longer-lived  (ri/2  >30  min)  in  ubrlA  cells 
(Figure  4).  Degradation  of  Cup9p  was  also  found  to 
depend  upon  UBC2  and  UBC4  (data  not  shown),  in 
agreement  with  the  observation  that  a  ubc2A  ubc4A  double 
mutant  failed  to  import  dipeptides  (Figure  IG).  The 
residual  instability  of  Cup9p  in  ubrlA  cells  (Figure  4) 
suggested  the  presence  of  a  second,  Ubrlp- independent 
degron;  this  pattern  is  reminiscent  of  another  homeo¬ 
domain  repressor,  Mata2p,  which  also  contains  at  least  two 
distinct  degradation  signals  (Hochstrasser  and  Varshavsky, 
1990;  Chen  et  al,  1993). 

Ubrlp  recognizes  engineered  N-end  rule  substrates 
through  their  destabilizing  N-terminal  residues 
(Varshavsky,  1 996).  To  determine  the  N-terminal  residue 
of  Cup9p,  we  overexpressed  and  purified  Cup9p-FLAG 
from  ubrIA  S.cerevisiae  and  subjected  it  to  N-terminal 
sequencing.  Cup9p-FLAG  was  found  to  have  a  blocked 
(presumably  acetyiated)  N-terminus  (see  Materials  and 
methods),  suggesting  that  Ubrlp  targets  Cup9p  through  a 
degron  distinct  from  the  N-degron.  Independent  evidence 
for  this  conjecture  was  produced  through  the  analysis  of 
GST-Cup9p-ha2,  a  fusion  of  glutathione  transferase  (GST) 
and  C-terminally  ha-tagged  Cup9p.  Pulse-chase  analysis 
of  GST-Cup9p-ha2  revealed  that  the  fusion  protein  was 
nearly  as  short-lived  in  vivo  as  the  N-terminally  unmodified 
Cup9p-FLAG  (Figure  4,  lanes  b-e;  compare  with  Figure 
5.  lanes  b-e).  If  Cup9p  were  targeted  for  process! ve 
degradation  through  a  destabilizing  N-terminal  residue,  a 
preliminary  proteolytic  cleavage(s)  of  Cup9p  would  be 
required  to  expose  such  a  residue  (Varshavsky,  1996).  In 
the  case  of  GST-Cup9p-ha2,  this  cleavage  would  generate 
a  proteolytic  fragment  consisting  of  GST  and  an  N-terminal 
portion  of  Cup9p  preceding  the  cleavage  site.  Since  free 
GST  has  been  found  to  be  long-lived  when  expressed  in 
yeast  (data  not  shown),  accumulation  of  such  a  proteolytic 
fragment  would  be  expected  to  accompany  the  degradation 
of  GST-Cup9p-ha2. 

Pulse-chase  analysis  of  GST-Cup9p-ha2,  using  gluta¬ 
thione-agarose  beads  to  isolate  GST-containing  proteins 
(see  Materials  and  methods),  indicated  that  the  degradation 
ot  GST-Cup9p-ha2  by  the  N-end  rule  pathway  was  not 
accompanied  by  the  appearance  of  a  fragment  containing 
the  N-terminal  GST  moiety  (Figure  5).  This  finding 
strongly  suggested  that  Cup9p  bears  an  ‘intemaT  degron 
recognized  by  Ubrlp.  Additional  support  for  this  conjec¬ 
ture  was  provided  by  truncation  analysis  of  a  Cup9p- 
DHFR-myc  fusion  protein.  These  experiments  indicated 
that  the  N-terminal  81  residues  of  the  306-residue  Cup9p 
are  dispensable  for  its  Ubr  1  p-dependent  degradation, 
strongly  suggesting  that  the  relevant  degradation  signal 
resides  in  the  C-terminal  two-thirds  of  Cup9p  (C.Byrd, 
I. Davydov  and  A.  Varshavsky,  unpublished  data).  Although 
these  results  are  fully  consistent  with  the  presence  of  an 
internal  Ubr  I  p-dependent  degron  in  Cup9p,  there  remains 
the  less  parsimonious  possibility  that  Cup9p  is  degraded 
via  rran,v-targeting  (Johnson  et  ai.  1990).  In  this  process, 
the  two  determinants  of  the  N-degron  (a  destabilizing 
N-terminal  residue  and  a  ubiquitin-accepting  internal  Lys 
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Fig.  4.  Degradation  of  Cup9p  by  the  N-end  rule  pathway.  Saccharomyces  cerevisiae  strains  carrying  pCB2I0,  a  low-copy  plasmid  that  expressed 
Cup9p-FLAG  from  the  ?cupi)  promoter,  were  labeled  at  30°C  with  [^^Slmethionine/cysteine  for  5  min.  followed  by  a  cha.se  for  0,  5.  15,  30  and 
60  min.  preparation  of  extracts,  immunoprecipitation  of  Cup9p-FLAG  with  a  monoclonal  anti-FLAG  antibody.  SDS-PAGE  and  autoradiography/ 
quantitation  (see  Matenals  and  methods).  Lane  a.  JD55  (ubrIA)  cells  that  carried  pCB200  (vector  alone).  Lanes  b-f.  JD52  {UBRD  (wild-type)  cells 
that  carried  pCB210.  expressing  Cup9p-FLAG.  Lanes  g-k,  same  as  lanes  b-f.  but  with  JD55  iuhrlA)  cells. 


residue)  reside  in  two  different  polypeptides,  whose  inter¬ 
action  yields  an  active  N-degron,  leading  to  ubiquitinyl- 
ation  and  degradation  of  the  polypeptide  bearing  the  Lys 
residue  (Johnson  et  ai,  1990).  At  present,  little  is  known 
about  the  protein  ligands  of  Cup9p,  save  for  the  likely 
possibility  that  Cup9p  interacts  with  the  Tuplp/Ssn6p 
repressor  complex  (see  above),  similarly  to  the  previously 
established  interaction  of  this  complex  with  Mata2,  a 
homeodomain-containing  homolog  of  Cup9p  (Smith 
et  ai,  1995). 


Discussion 

The  discovery  that  Ubrlp  controls  the  import  of  peptides 
through  degradation  of  the  Cup9p  repressor  (Figure  6) 
identifies  the  first  clear  physiological  function  of  the  N-end 
rule  pathway.  The  existence  of  mammalian,  plant  and 
bacterial  homologs  of  the  yeast  Ptr2p  transporter  suggests 
that  the  import  of  peptides  in  these  organisms  may  also 
be  regulated  by  the  N-end  rule  pathway. 

Why  was  this  pathway,  rather  than  another  Ub-depend- 
ent  proteolytic  system,  recruited  in  the  course  of  evolution 
for  the  regulation  of  peptide  import?  A  plausible  explan¬ 
ation  is  suggested  by  the  ability  of  Ubrlp  to  bind  peptides 
bearing  destabilizing  N-terminal  residues  (Varshavsky, 
1996).  Since  more  than  half  of  the  20  amino  acids  are 
destabilizing  in  the  yeast  and  mammalian  N-end  rules 
(Varshavsky,  1996),  a  significant  fraction  of  imported 
peptides  would  be  expected  to  compete  with  Cup9p  for 
binding  to  Ubrlp.  This  competition  may  decrease  the  rate 
of  Cup9p  degradation.  [Dipeptides  added  to  a  culture  of 
S. cerevisiae  have  been  shown  to  inhibit  the  N-end  rule 
pathway  (Baker  and  Varshavsky,  1991).]  The  ensuing 
increase  in  the  level  of  Cup9p  repressor  would  in  turn 
decrease  the  production  of  the  Ptr2p  transporter,  diminish¬ 
ing  the  rate  of  peptide  import.  This  ‘peptide-sensing’ 
negative  feedback  loop  would  maintain  the  intracellular 
concentration  of  short  peptides  within  a  predetermined 
range — potentially  a  useful  feature,  made  possible  by  the 
substrate-binding  properties  of  Ubrlp.  Experiments  to 
verify  this  model  are  under  way. 

Another  interesting  aspect  of  the  Ubrlp-Cup9p-Ptr2p 


wild-type  ubrlA 

Chase  (min):  0  5  10  30  0  5  10  30 

200- 
97.4- 


69- 


46-  ■'4' 


-  GST-Cup9p  - 


abode  f  g  h  I 

Fig.  5.  Degradation  of  GST-Cup9p-ha2  is  not  preceded  by  a  processing 
that  releases  a  GST-containing  fragment.  Saccharomyces  cerevisiae 
strains  carrying  pCB120,  a  low-copy  plasmid  that  expressed  GST- 
Cup9p-ha2  from  the  Pgali  promoter,  were  labeled  at  30''C  with 
[^^S]methionine/cystetne  for  5  min,  followed  by  a  chase  for  0,  5,  10 
and  30  min.  preparation  of  extracts,  isolation  of  GST-Cup9p-ha2  on 
glutathione-agarose  beads,  SDS-PAGE  and  autoradiography/ 
quantitation  (see  Materials  and  methods).  Lanes  b-e,  JD52  {UBRI) 
(wild-type)  cells  expressing  GST-Cup9p-ha2.  Lanes  f-i.  same  as  lanes 
b-e,  but  with  JD55  (ubrIA)  cells.  Lane  a,  molecular  mass  markers 
(in  kDa). 


regulatory  circuit  is  its  possible  relevance  to  copper 
homeostasis.  In  addition  to  functioning  as  a  repressor  of 
peptide  transport,  Cup9p  also  contributes  to  the  resistance 
of  S. cerevisiae  to  copper  toxicity  during  growth  on  lactate, 
a  non-fermentable  carbon  source  (Knight  et  ai,  1994). 
Copper  homeostasis  is  mediated  by  a  complex  set  of 
circuits,  some  of  which  also  regulate  iron  metabolism 
(Zhou  and  Thiele,  1993;  Ooi  et  ai,  1996).  The  double 
role  of  Cup9p  in  regulating  both  peptide  import  and  copper 
homeostasis  may  signify  a  physiological  connection 
between  these  seemingly  unrelated  processes. 
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Fig.  6.  A  model  for  regulation  of  peptide  impon  in  S.cerevisiae.  The 
expression  of  PTR2.  which  encodes  a  transmembrane  peptide 
transponer.  is  regulated  by  the  short-lived,  homeodomain-containing 
transcriptional  repressor  Cup9p.  The  concentration  of  Cup9p  is 
controlled  in  part  through  its  degradation  by  the  N-end  rule  pathway, 
whose  targeting  components  include  Ubrlp  and  either  Ubc2p  or 
Ubc4p  {see  the  main  text). 


Our  work  adds  Cup9p  to  the  list  of  short-lived  regulatory 
proteins  whose  degradation  is  mediated  by  the  ubiquitin 
system.  Many  of  these  proteins  are  negative  regulators.  For 
example.  Mata2p,  a  homeodomain-containing  homolog  of 
Cup9p,  controls  the  mating  type  of  S.cerevisiae  (a  or  a) 
through  repression  of  a-specific  genes  (Herskowitz,  1989). 
Mata2p  appears  to  be  constitutively  short-lived  in  haploid 
cells  (Hochstrasser  and  Varshavsky,  1990).  Therefore, 
cessation  of  Mata2p  synthesis  upon  the  conversion  of  an 
a  cell  into  an  a  cell  results  in  rapid  disappearance 
of  Mata2p  and  the  establishment  of  a-specific  circuits 
(Hochstrassser,  1996).  Progression  of  the  cell  cycle  is  also 
controlled  by  short-lived  negative  regulators,  in  particular 
by  Sic  Ip,  an  inhibitor  of  CDK,  the  cyclin-dependent 
kinase  (Schwob  et  al..  1994).  In  this  case,  however,  a 
rapid  ‘on-switch’  is  based  on  the  phosphorylation-induced 
degradation  of  the  previously  stable  Sic  Ip  (King  et  al., 
1996).  Whether  the  Cup9p  repressor  (Figure  6)  is  constitu¬ 
tively  short-lived  or  whether  its  stability  is  regulated  by 
external  conditions  such  as,  for  example,  different  nitrogen 
sources,  remains  to  be  determined. 

The  finding  that  Cup9p  lacks  a  destabilizing  N-terminal 
residue  indicates  that  Ubrlp,  the  recognition  component 
of  the  N-end  rule  pathway,  is  able  to  target  substrates 
bearing  either  internal  degradation  signals  or  N-degrons. 
The  Ga  subunit  of  the  G  protein — the  other  known 
physiological  substrate  of  Ubrlp  in  S.cerevisiae — also 
lacks  a  destabilizing  N-terminal  residue  (Madura  and 
Varshavsky,  1994).  Thus,  a  substantial  fraction  of  naturally 
short-lived  proteins  targeted  by  the  N-end  rule  pathway 
may  bear  internal  degrons  rather  than  N-degrons,  a  pos¬ 
sibility  that  the  identification  of  other  physiological  sub¬ 
strates  of  this  pathway  will  address. 

Materials  and  methods 

Strains,  media  and  genetic  techniques 

Saccharomyces  cerevisiae  strains  were  grown  in  rich  (YPD)  medium 
containing  29c  peptone.  I  %  yeast  extract  and  29c  glucose  or  in  synthetic 
yeast  media,  containing  0.67%  yeast  nitrogen  base  without  amino  acids 
(Difco),  auxotrophic  nutrients  at  concentrations  specified  by  Sherman 
et  al.  (1986)  and  either  29c  glucose  (SD  medium),  2%  raffinose  (SM- 
raffinose  medium)  or  29c  galactose  (SM-galactose  medium)  as  carbon 
sources.  SHM-glucose  medium  used  in  halo  assays  was  prepared 
according  to  Island  et  al.  (1987)  and  was  identical  to  SD  medium  except 
that  it  lacked  methionine  and  contained  allantoin  ( 1  mg/ml)  and  yeast 
nitrogen  base  (Difco,  1.7  mg/ml)  without  amino  acids  and  ammonium 


sulfate.  Escherichia  coli  strain  DH5a  was  used  for  plasmid  propagation 
and  cloning  steps.  For  induction  of  the  Pqali  promoter,  cells  were  grown 
to  an  /I500  of  0.5-1  in  SM-raffinose  medium,  pelleted  and  transferred 
to  SM-galactose  medium  for  3  h. 

Halo  and  dilution  assays 

Peptide  import  was  assayed  using  the  halo  and  dilution  methods.  For 
halo  assays  (Island  et  al.,  1987),  cells  were  grown  to  an  ^^(X)  of  -1  in 
SHM-glucose  medium  with  auxotrophic  supplements.  Cells  were  pelleted 
by  centrifugation  and  resuspended  in  water  to  -5X10^  cells/ml.  Then 
0.1  ml  of  cell  suspension  was  mixed  with  10  mi  of  0.8%  noble  agar 
(Difco)  at  55°C.  and  spread  on  plates  containing  20  ml  of  SHM  medium. 
Sterile  filter  disks  containing  the  toxic  dipeptide  L-leucyl-L-ethionine 
(Leu-Eth;  5  pmol)  were  placed  in  the  middle  of  each  plate,  followed  by 
incubation  at  30°C  for  1-2  days. 

For  dilution  as.says.  strains  were  grown  (under  selection  for  plasmids) 
in  SD  medium  to  an  A^,(x)  of  -1.  Cells  from  each  .sample  ( 1.5x  lOh  were 
spun  down  and  resuspended  in  1  ml  of  water.  The  samples  were  serially 
diluted  by  4-fold  in  a  microtiter  plate  (150  pl/well)  to  generate  eight 
different  initial  concentrations  of  cells  that  ranged  from  1.5x  10^  to  915 
cells/ml.  The  suspensions  were  spotted  to  various  media,  using  a  48-pin 
applicator.  The  plates  were  incubated  at  30“C  for  1-2  days. 

Leu-Eth  was  synthesized  using  standard  methods  of  organic  chemistry. 
Briefly.  L-ethionine  methyl  ester  (Eth-OMe)  was  produced  from 
L-ethionine  and  methanol.  Eth-OMe  was  then  coupled  with  N-tert- 
butoxy-carbonyl-L-leucine-/7-nitrophenyl  ester  (N-r-BOC-L-leucine- 
PNP).  yielding  N-/-BOC-L-leucyl-L-ethionine  methyl  ester,  which  was 
punhed  by  flash  chromatography  on  a  silica  column.  N-/-BOC-L-leucyl- 
L-ethionine  methyl  ester  was  then  converted  into  N-r-BOC-L-leucyl-L- 
ethionine  by  treatment  with  KOH.  N-r-BOC-L-Ieucyl-L-ethionine  was 
deprotected  with  trifluoroacetic  acid,  yielding  Leu-Eth. 

A  screen  for  import^competent  mutants  in  the  ubrIA 
background 

Thirty  5-ml  cultures  of  JD83-1 A  (leuI-S  ubrIA),  auxotrophic  for  leucine, 
were  grown  to  an  of  -“I-  iiod  -1.5xl0’^  cells  from  each  culture 
were  plated  onto  SD  medium  lacking  leucine  and  histidine  but  containing 
the  leucyl-histidine  dipeptide  at  0.23  pM.  Plates  were  incubated  at  30°C 
for  2  days,  selecting  for  mutants  able  to  grow  on  this  medium.  Of  the 
320  sub  mutants  (‘suppressors  of  a  block  to  peptide  import  in  ubrIA') 
thus  obtained.  199  were  found  by  complementation  tests  to  be  clearly 
recessive.  Among  these,  101  mutants  belonged  to  one  complementation 
group,  termed  sub  I. 

Isolation  of  the  CUPS  (SUB1)  gene 

A  50  ml  culture  ot  S.cerevisiae  CBY15  {ubrIA  subl~l)  was  grown  to 
an  A^joo  of  - 1  in  SHM-glucose.  Cells  were  made  competent  with  lithium 
acetate  (Ausubel  et  al.,  1992).  and  25  0.1-ml  samples  (-7X10^  cells/ 
ml)  were  transformed  with  a  yeast  genomic  DNA  library  (American 
Type  Culture  Collection;  #77164)  in  the  TRPI,  C£’/V6-based  vector 
pRS200  (Sikorski  and  Hieter,  1989).  The  cells  were  pelleted,  and  each 
sample  was  resuspended  in  1.5  ml  of  SHM-glucose  and  incubated  at 
30°C  for  3  h.  The  cells  were  pelleted  again,  each  sample  was  resuspended 
in  0.1  ml  of  water,  added  to  10  ml  of  0.8%  Noble  agar  at  55‘’C,  and 
spread  onto  SHM-glucose  plates  lacking  Trp  and  containing  Leu-Eth  at 
37  pM.  Of  the  -2.5X10‘^  Trp^  transformants  plated,  14  could  grow  in 
the  presence  of  Leu-Eth.  Of  these,  two  yielded  the  library -derived 
plasmids,  pSUBl-1  and  pSUBl-6,  that  complemented  the  sub  I  mutation 
of  CBY15.  The  insert  of  pSUBl-6  contained  the  insert  of  pSUBl-1. 
Partial  sequencing  of  the  -8.2  kb  insert  of  pSUBl-1  identified  it  as  a 
region  of  the  S.cerevisiae  chromosome  XVI.  Deletion  analysis  (not 
shown)  localized  the  complementing  activity  to  an  -2.8  kb  //mdlll- 
Kpnl  fragment,  whose  sequence  revealed  the  presence  of  two  ORFs 
{CIJP9  and  SCYPL178W).  Further  deletion  analysis  (not  shown)  local¬ 
ized  the  complementing  activity  to  the  previously  isolated  (Knight  et  ai, 
1994)  CVP9  gene.  Verification  that  Cl!P9  and  SUBI  were  one  and  there 
same  gene  was  carried  out  as  described  in  Results. 

Construction  of  null  cupS  mutants 

The  ~2.8  kb  HindUl-Kpnl  genomic  DNA  fragment  containing  CUP9 
was  ligated  to  HindlU~Kpn\-c\ii  pRS306ASpel.  yielding  pCB117. 
pRS306ASpel  was  derived  from  the  f//?AJ-bearing  pRS306  (Sikorski 
and  Hieter,  1989)  through  elimination  of  its  Spel  site.  The  CUP9  ORF 
was  disrupted  by  inserting  an  -2  kb,  bluni-ended,  L£'(/2-containing  Sail 
fragment  from  pJJ283  into  the  Spel  site  of  CUP9,  yielding  pCB119,  in 
which  the  LEU2  and  CUP9  ORFs  were  oriented  in  opposite  directions. 
An  -5  kb  //mdlll-PvwlI  fragment  of  pCBl  19  containing  the  cup9::LEU2 
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Table  I.  Saccharomyces  cerevisiae  strains  used  in  this  study 

Strain  Genotype  References 


DF5 

EJl 

EJ2 

EJ3 

EJIO 

EJY102 

EJY105 

EJY106 

MHY495 

MHY503 

MHY507 

MHY550 

MHY552 

MHY554 

MHY557 

MHY570 

MHY599 

MHY601 

YW055 

YPH500 

BBY67 

KJVI207-1 

JD34 

JD51 

JD52 

JD53 

JD55 

JD83-1A 

RJD549 

RJD795 

SGY6 

CBY4 

CBY5 

CBY6 

CBY7 

CBY8 

CBY9 

CBYIO 

CBYll 

CBY15 

CBY16 

CBY17 

CBY19 

CBY23 

CBY24 


MATa/MATa  trpl-l/trpJ-I  ura3-52/ura3-52  his3-A200/his3-A200  Ieii2-3J}2/Ieu2-3J12  lys2-801/ 
lys2-801  gal/gal 

MATql  ubc5A::LEU2  trpl-l  ura3-52  his3~A200  Ieu2-3J12  lys2~80l  gal 

MATol  uhclA::URA3  trpl-l  ura3‘52  his3-A200  Ieu2-3JI2  lys2'80l  gal 

MATa  ubc4A::HIS3  trpl-l  ura3-52  his3-A200  Ieu2-3JI2  Iys2-80I  gal 

MATa  ubc4A::HIS3  ubc5A::LEU2  trpI-I  ura3-52  his3-A200  Ieu2-3JI2  Iys2-80I  gal 

MATa  ubc2A::LEU2  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801  gal 

MATa  uhc2A::LEU2  ubc4A::HlS3  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801  gal 

MATa  ubc2A::LEU2  ubc4A::HlS3  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-80l  gal 

MATa  ubc6-Al ::HIS3  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801 

MATa  ubc4-Al ::HIS3  uhc6-Al ::H1S3  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-80l 

MATa  ubc7::LEU2  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-80l 

MATa  uhc4-A2::TRPl  uhc5-Al ::LEU2  ubc6-Al ::HIS3  trpl-l  ura3-52  his3-A200  Ieii2-3J!2 
lys2-80l 

MATa  ubc6-Al ::HIS3  ubc7::LEU2  trpl-l  ura3-52  his3-A200  Ieit2-3J12  lys2-80l 

MATa  ubc4-Al::HIS3  uhc7::LEU2  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801 

MATa  ubc4-Al;:HIS3  ubc6-Al ::HIS3  ubc7::LEU2  trpl-l  ura3-52  his3-A200  leu2-3.112  lys2-801 

MATa  ubc4-A2::TRPl  uhc5-Al ::LEU2  ubc6-Al ::H1S3  ubc7::LEU2  trpl-l  ura3-52  his3-A200 

Ieu2-3J12  lys2-80l 

MATa  pas2  trp  ura3-52  adel  leu2-3 

MATa  ubc8::URA3  trpl-l  ura3-l  ade2-l  his3-ll  Ieu2-3J12  can  1 -100 

MATa  ubc9-l  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801 

MATa  trpl-A63  ura3-52  ade2-l0l  his3-A200  Ieu2-Al  lys2-80l 

MATa  ubc2A::LEU2  trpl-A63  ura3-52  ade2-l0l  his3-A200  Ieu2-Al  lys2-80l 

MATa  atel A::TRPl  trpl-A63  ura3-52  ade2-10l  his3-A200  Ieu2-Al  lys2-801 

MATa  ubc2A::URA3  trpl-A63  ura3-52  ade2-101  his3-A200  Ieu2-Al  lys2-80l 

MATalMATa  trpl -A63/trpl -A63  ura3-52/ura3-52  his3-A200/his3-A200  Ieu2-3J  12/Ieu2-3J  12 

lys2-80lAys2-801 

MATa  trpl-A63  ura3-52  his3-A200  Ieu2-3J12  lys2-80l 

MATa  trpTA63  ura3-52  his3-A200  Ieu2-3J12  lys2-80l 

MATa  ubrl A::HIS3  trpl-A63  ura3-52  his3-A200  Ieu2-3J12  lys2-80l 

MATa  ubrlA::HlS3  trpl-A63  ura3-52  his3-A200  Ieu2-3J12  lys2-80l 

MATa  cdc34-l  trpl  ura3-52  leu2-3 

MATa  cdc34-2  trpl  ura3-52  leu2-3 

MATa  ntalA::TRPl  trpl-A63  ura3-52  ade2-l0l  /i/ji-A200  Ieu2-3J12  lys2-801 
MATa  ubc2A::URA3  ubcSA: :LEU2  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801  gal 
MATa  ubc2A::LEU2  ubc6-Al ::H1S3  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801 
MATa  ubc2A::URA3  ubc7::LEU2  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-80l 
MATa  ubc2A::LEU2  ubc8::VRA3  trpl-l  ura3-l  ade2-l  his3-ll  Ieu2-3J12  canl-lOO 
MATa  ubc2A::LEU2  ubcl A::URA3  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-80l  gal 
MATa  ubc2A::LEU2  iibc9-l  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801 
MATa  ubc2A::LEU2  pas2  trp  ura3-52  adel  leu2-3 
MATa  ubc2A::LEU2  cdc34-l  trpl  ura3-52  leu2-3 

MATa  subl-l  ubrlA::HIS3  trpl-A63  ura3-52  his3-A200  Ieu2-3J12  lys2-801 
MATa  cup9::LEU2  ubrlA::HlS3  trpl-A63  ura3-52  his3-A200  Ieii2-3J12  lys2-801 
MATa  cup9::LEU2  ubrlA::HlS3  trpl-A63  ura3-52  his3-A200  Ieu2-3J12  lys2-801 
MATa  cup9::LEU2  trpl-A63  ura3-52  his3-A200  Ieu2-3J12  lys2-801 

MATalMATa  subl-l/cup9A::LEU2  ubrlA::HIS3/ubrlA::HIS3  trpl -A63/trpl -A63  ura3-52/ura3-52 
his3-A200/his3-A200  Ieu2-3J12/Ieu2-3J12  lys2-801Ays2-801 

MATalMATa  subl-l/cup9A::LEU2  ubrlA::HIS3/ubrlA::HIS3  trpl -A63 Arp  1 -A63  ura3-52Aira3-52 
his3-A200Atis3-A200  leu2-3,112Aeu2-3J12  lys2-801Ays2-801 


Finley  et  ai  (1987) 

Derivative  of  DF5^ 
Derivative  of  DFS"^ 
Derivative  of  DF5“ 
Derivative  of  DF5^ 
Derivative  of  DF5“ 
Derivative  of  DF5“ 
Derivative  of  DF5^ 
Derivative  of  DF5^ 
Derivative  of  DF5^ 
Derivative  of  DF5^ 
Derivative  of  DF5^ 

Derivative  of  DF5^ 
Derivative  of  DFS*’ 
Derivative  of  DF5*’ 
Denvative  of  DF5^ 


Chen  et  al.  ( 1993) 

Chen  et  al.  ( 1993) 

Denvative  of  DF5*-’ 

Sikorski  and  Hieter  (1989) 
Derivative  of  YPH500‘^ 
Derivative  of  YPHSOO^ 
Derivative  of  YPH500^ 

Dohmen  et  ai  ( 1995) 

Johnson  et  ai  (1995) 

Dohmen  et  al.  (1995) 

Madura  and  Varshavsky  (1994) 
Derivative  of  JD5P 
R.Deshaies^ 

R. Deshaies^ 

S.  Grigoryev^ 

Denvative  of  EJl 
Derivative  of  MHY495 
Derivative  of  MHY507 
Derivative  of  MHY601 
Denvative  of  EJ2 
Derivative  of  YW055 
Derivative  of  MHY599 
Derivative  of  RJD549 
Derivative  of  JD83-1A 
Derivative  of  JD55 
Derivative  of  JD55 
Derivative  of  JD52 

Produced  by  mating  CBY15  and 
CBY16 

Produced  by  mating  CBY15  and 
CBY17 


'‘Johnson  et  al.  (1992,  1995).  A  gift  from  E.Johnson,  the  Rockefeller  University,  New  York,  NY  10021-6399.  USA. 

^Chen  et  ai  (1993).  A  gift  from  M.Hochstrasser,  Department  of  Biochemistry  and  Molecular  Biology,  University  of  Chicago.  Chicago,  IL  60637, 
USA. 

“^A  gift  from  S.Jentsch,  ZMBH.  Universitat  Heidelberg,  69120  Heidelberg,  Germany. 

‘^Dohmen  et  ai  (1990). 

'^A  gift  from  K.Madura.  Department  of  Biochemistry,  UMDNJ-Johnson  Medical  School,  Piscataway,  NJ  08854,  USA. 

^A  gift  from  J. Dohmen.  Heinrich-Heine-Universitat,  Institut  fur  Mikrobiologie,  40225  Diisseldorf,  Germany. 

=A  gift  from  R.Deshaies.  Division  of  Biology,  Caltech,  Pasadena,  CA  91125,  USA. 

^A  gift  from  S.Grigoryev,  Department  of  Biology,  University  of  Massachusetts,  Amherst.  MA  01003,  USA. 


disruption  allele  was  u.sed  to  replace  the  wild-type  CUP9  alleles  of  JD52 
(wild-type)  and  JD55  (ubrl A)  by  homologous  recombination  (Rothstein. 
1991),  generating  strains  CBY19  and  CBY17,  respectively. 

CUP9-expressing  plasmids 

The  plasmid  pCB116.  which  expressed  Cup9p-FLAG  from  the  ?o,a,li 
promoter,  was  constructed  by  subcloning  an  ~1  kb  fi«mHI-£a;Rl 
fragment  containing  the  CUP9-FLAG  ORF  into  the  BamH\-EcoR\  site(s) 
of  p416GALl  (Mumberg  et  ai.  1994).  The  CUP9-£L4G-containing 
fragment  was  constructed  by  PCR  amplification  of  the  CUP9  ORF  of 


plasmid  pCBlll  using  primers  PCBl  (5'-CGCGGATCCGAATAGT- 
TACATTCGAAGATG-3')  and  PCB6  (5'-CCGGAATTCTCATTTA- 
TCATCATCGTCTTTGTAATCATTCATATCAGGGTTGGATAG-3'), 
resulting  in  the  addition  of  the  8-residue  FLAG  epitope.  DYKDDDDK, 
to  the  C-terminus  of  Cup9p.  pCBl  1 1  was  constructed  by  subcloning  the 
-2.8  kb  HindlU-Kpnl  fragment  of  the  CUP9-containing  pSUBl-t  (.see 
above)  into  HindUl-Kpnl-cut  pRS316  (Sikorski  and  Hieter.  1989). 
pCB202  was  constructed  by  subcloning  an  ~1  kb  fragment  containing 
the  PcuP9  promoter  into  f/mdlll-Ba/nHl-cut  pCB201.  [The  -I  kb 
fragment  was  produced  by  PCR  from  pCBlll  using  primers  PCB8 
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{5'-GTGTTAGTAAGCTTGTAAAGGAATGCACGTATT-3')  and  PCB9 
(5'-CCCGCGGATCCGCATGCAACTATTCTCGAAGGTTGT-3').] 
pCB200  {ARS-CEN,  LEU2)  and  pCB20l  (2|i,  LEU2)  were  constructed 
by  replacing  the  517  bp  5caI-£coRI  fragment  of  pBR322  (Ausubel 
et  ai,  1992)  with,  respectively,  the  3822  bp  Sca\-Nae\  fragment  of 
pRS415  (Sikorski  and  Hieter,  1989)  and  the  4650  bp  Sca\~Nae\  fragment 
of  pRS425  (Christianson  et  ai.  1992). 

The  plasmid  pCB209  (2|j.,  LEU2),  which  expressed  CUP9  from  the 
^CUP9  promoter,  was  constructed  by  replacing  the  Sph\~Sal\  fragment 
of  pCB202  with  an  -1  kb  fragment  containing  the  CUP9  ORF  that 
was  produced  by  PCR  from  pCBllI,  using  primers  PCBIO  (5'-CCC- 
GCGGATCCGCATGCGAAGATGAATTATAACTGC-3')  and  PCB12 
( 5  '-CCCGCGCGGTCG  ACCTC  AATTC  ATATC  AGGGTTGG  ATAG-3 ' ). 
pCB210  {ARS-CEN,  LEU2)  that  expressed  Cup9p-FLAG  from  the  Pcup9 
promoter  was  constructed  by  replacing  the  Sphl-Sall  fragment  of  pCB202 
with  an  kb  fragment  containing  the  CUP9-FLAG  ORF.  which  was 
produced  from  pCBlIl  using  primers  PCBIO  (see  above)  and  PCB13 
(5'-CCCGCGCGGTCGACCfCATTTATCATCATCGTCTTTGTA- 
ATCATTCATATCAGGGTTGGATAG-3').  yielding  pCB211.  The  ~2  kb 
//mdlll-5an  fragment  of  pCB2ll  containing  PcuP9  and  the  CUP9- 
FLAG  ORF  was  subcloned  into  pCB200.  yielding  pCB210. 

Plasmid  pCB  120.  expressing  GST-Cup9p-ha2  from  the  P^^^;  promoter, 
was  constructed  by  subcloning  the  --1.6  kb  Xha\~EcoK\  fragment, 
containing  the  GST-CUP9~ha2  ORF.  into  the  Xbal-EcoRl  site(s)  of 
p416GALl.  The  Xha\-EcoR\  fragment  was  produced  by  PCR  amplifica¬ 
tion  of  the  GST-CUP9-hQ2  ORF  of  pGEX-2T-CUP9-ha2.  using  the 
pnmers  PCB3  (5'-CCGGAATTCTCAAGCGTAATCTGGAACATC- 
GTATGGGTAAGCGTAATCTGGAACATCGTATGGGTAATTCATA- 
TCAGGGTTGGATAG-3')  and  PCB5  (5'-TGCTCTAGAACAGT- 
ATTCATGTCCCCTATA-3').  pGEX-2T-CUP9-ha2  was  constructed  by 
subcloning  an  -1  kb  fragment  containing  the  CUP9~ha2  ORF  into  the 
fiamHl-E'coRl  site(s)  of  pGEX-2T  (Pharmacia),  resulting  in  an  in-frame 
fusion  of  the  sequence  encoding  26  kDa  glutathione  5-transferase  (GST) 
domain  of  Saccharomyces  japonicum  (Smith  and  Johnson.  1988)  to  the 
second  codon  of  CUP9.  TTie  Ct/P9-/ia2-containing  fragment  was  pro¬ 
duced  by  PCR  amplification  of  the  CUP9  ORF  of  pCB  1 1 1  using  the 
primers  PCB3  (see  above)  and  PCB4  (5'-CGCGGATCCAATTAT- 
AACTGCGAAATACAAAAC-3').  This  step  added  to  the  C-lerminus  of 
Cup9p  a  sequence  encoding  a  tandem  repeat  of  the  9-residue  sequence 
YPYDVPDYA,  derived  from  hemagglutinin  (ha)  of  influenza  virus. 

Northern  hybridization 

RNA  was  isolated  from  S.cerevisiae  as  described  (Schmitt  et  al.,  1990). 
Electrophoresis  of  the  RNA  samples  was  carried  out  on  a  formaldehyde 
RNA  gel  (Ausubel  et  ai.  1992).  An  -50-pg  RNA  sample  was  loaded 
on  a  \%  agarose  gel  containing  1  x  MOPS  buffer.  0.74%  (v/v)  formalde¬ 
hyde.  1.9  mg/ml  iodoacetamide  and  0.5  p-g/ml  ethidium  bromide. 
Electrophoresis  was  carried  out  in  IX  MOPS  buffer  at  5  V/cm. 
RNA  was  transferred  to  B  rights  tar- PI  us  membrane  (Ambion)  using 
TurboBlotter  (Schleicher  &  Schuell)  and  Ambion  RNA  transfer  buffer. 
RNA  was  crosslinked  to  the  air-dried  membranes  using  254  nm  light 
(Ausubel  et  ai,  1992). 

DNA  probes  were  prepared  by  the  random  priming  method  (Ausubel 
et  ai,  1992)  using  [^^PldCTP  and  a  DNA  labeling  kit  (Pharmacia). 
Hybridization  was  carried  out  for  8-16  h  at  42°C  in  Prehybridization/ 
Hybridization  Solution  (Ambion).  Filters  were  washed  according  to  the 
manufacturer's  protocol  and  subjected  to  autoradiography. 

Gei  shift  assay 

PCR  was  used  to  extend  the  Cup9p  ORF  with  a  sequence  encoding 
Ser-Gly-Gly-Thr-Hisft,  yielding  Cup9p-H5,  and  to  engineer  flanking 
restriction  sites  (Ndel  and  BamHl)  for  insertion  into  pET-1  Ic  (Novagen). 
Cup9p-Hft  was  overexpres.sed  in  E.coli  BL21  (DE3)  (Novagen)  (Ausubel 
et  al..  1992)  and  purified  on  a  3  ml  Ni-NTA  column  (Qiagen),  using  a 
linear  gradient  of  imidazole.  Cup9p-H6  eluted  at  -0.25  M  imidazole 
(-90%  pure  at  this  step);  it  was  dialyzed  at  4°C  against  10%  glycerol. 
0.1  M  KCl,  1  mM  EDTA.  0.5  mM  dilhiothreitol,  20  mM  HEPES. 
pH  7.9.  and  then  snap-frozen  in  multiple  samples  in  liquid  N2,  and 
stored  at  -80”C.  The  proximal  (-1  to  -AAl)  and  distal  (-448  to  -897) 
PTR2  promoter  probes  for  the  gel  shift  assay  were  constructed  by  PCR 
amplification  in  the  presence  of  [a-^^P]dCfp.  and  were  purified  using 
spin  columns  (Qiagen).  The  gel  shift  reactions  (20  pi)  contained 
50  pg/ml  poly-dl  dC  (Pharmacia);  -1.5  pg/ml  (500  c.p.m.)  DNA  probe; 

1  mg/ml  acetylated  serum  albumin  (New  England  Biolabs)  and  either 
O.I.  0.5  or  1  pg/m!  of  Cup9p-H6  in  10%  glycerol.  0.1  M  KCl.  2.5  mM 
MgCN.  1  mM  EDTA.  20  mM  HEPES.  pH  7.9.  The  samples  were 
incubated  for  30  min  at  room  temperature,  then  loaded  onto  a  4% 


polyacrylamide  gel  (40:1,  acrylamide:Z?i>acrylainide)  in  0.5  X  TBE 
(Ausubel  et  al.,  1992),  and  electro phoresed  at  10  V/cm  for  3  h  at  4“C, 
followed  by  autoradiography. 

Puise-chase  analysis  of  Cup9p 

One  hundred  ml  cultures  of  S.cerevisiae  JD52  (UBRI)  and  JD55  {ubrJA) 
carrying  either  pCB210  (expressing  Cup9p-FLAG  from  the  CUP9 
promoter)  or  pCB200  (vector  alone)  were  grown  to  an  A^oo  of  -1  in 
SD(-Leu)  medium.  Cells  (50  A^qo  units  total)  from  each  of  the  four  cul¬ 
tures  were  gently  pelleted  by  centrifugation,  washed  with  5  ml  of  SD(- 
Leu),  pelleted  again,  resuspended  in  2  ml  of  SD(-Leu),  and  incubated 
at  30°C  for  10  min.  Each  sample  was  labeled  for  5  min  with  1.4  mCi 
of  (^^Sjmethionine/cysteine  (EXPRESS.  New  England  Nuclear)  at  30‘’C, 
followed  by  pelleting  in  a  microfuge  for  - 15  s.  The  cells  were  resuspended 
in  2.6  ml  of  SD(~Leu),  5  mM  L-methionine.  5  mM  L-cysteine,  and 
incubated  at  30°C.  Samples  of  0.5  ml  were  withdrawn  during  the 
incubation,  pelleted  and  resuspended  in  0.15  ml  of  0.5  M  NaCl-Lysis 
Buffer  (1%  Triton  X-100,  0.5  M  NaCl,  5  mM  EDTA.  50  mM  Na- 
HEPES,  pH  7.5)  containing  a  mixture  of  protease  inhibitors  (Ghislain 
et  ai,  1996).  Glass  beads  (0.5  mm)  were  added,  and  cells  were  disrupted 
by  vortexing  (six  times,  for  30  s  each,  with  1  min  incubations  on  ice  in 
between),  followed  by  the  adjustment  of  NaCl  concentration  to  0.15  M 
through  the  addition  of  75  mM  NaCl-Lysis  Buffer,  further  vortexing  for 
30  s.  and  centrifugation  at  12  000  g  for  10  min.  The  volumes  of 
supernatants  were  adjusted  to  equalize  the  amounts  of  10%  trichloroacetic 
acid-insoluble  ^^S.  Cup9p-FLAG  was  immunoprecipitated  by  the  addition 
of  20  ^il  of  the  monoclonal  anti-FLAG  M2  antibody  conjugated  to 
agarose  beads  (Kodak).  Suspensions  were  incubated  at  0°C  for  I  h,  with 
rotation,  then  centrifuged  at  12  000  ^  for  30  s,  and  washed  four  times 
with  0.8  ml  of  0.15  M  NaCl-Lysis  Buffer.  The  pellets  were  resuspended 
in  SDS-sample  buffer,  heated  at  i00‘"C  for  3  min,  and  subjected  to  SDS- 
12%  PAGE,  followed  by  autoradiography  and  quantitation  using  a 
Phosphorlmager  (Molecular  Dynamics). 

Pulse-chase  analysis  of  GST-CupOp-hao  was  carried  out  as  described 
by  Bartel  et  ai  (1990).  Approximately  10  A<;,(x)  Limits  of  galactose-induced 
cells  were  labeled  for  5  mm  with  0.3  mCi  of  [■'*^S] EXPRESS  in  400  ^il 
SM-galactose  (-Ura)  at  30°C.  The  cells  were  then  transferred  to 
microfuge  tubes,  pelleted  and  resuspended  in  500  )il  of  SD  (-Ura). 
5  mM  L-methionine,  5  mM  L-cysteine.  Samples  of  0. 1  ml  were  withdrawn 
dunng  the  incubation,  pelleted  and  lysed  as  above.  The  ^^S-labeled  GST- 
Cup9p-ha2  was  purified  using  glutathione-agarose  beads  (Sigma)  which 
had  been  blocked  with  bovine  serum  albumin  (BSA;  10  mg/ml).  Twenty 
p.1  of  glutathione-agarose  beads  were  added  to  each  sample  and  the 
suspensions  were  incubated  at  0°C  for  60  min,  with  rotation,  followed 
by  washes  and  electrophoretic  analyses  as  described  for  Cup9p-FLAG. 

Purification  and  N-terminal  sequencing  of  Cup9p-FLAG 

Four  2-1  cultures  of  JD55  (ubrlA)  carrying  pCB116  that  expressed 
Cup9p-FLAG  from  the  GALI  promoter  were  grown  under  selection  in 
SM-raffinose  to  an  A^qo  of  -'O.S.  followed  by  transfer  to  SM-galactose 
and  incubation  at  30°C  for  3  h.  Longer  induction  times  resulted  in  a 
Cup9p-mediated  cytotoxicity  and  lower  yields  of  Cup9p-FLAG.  The 
cells  (-1 X  10*^)  were  harvested  and  lysed  at  4®C  as  described  by  Burgers 
(1995).  The  extract  was  fractionated  by  precipitation  with  0.4%  Polymin  P 
(Sigma)  and  then  further  by  precipitation  with  48%  saturated  ammonium 
sulfate.  The  pellet  was  dissolved  in  3  ml  of  TBS  buffer  (0.15  M  NaCl, 
50  mM  Tris-HCI,  pH  7.5),  and  passed  through  Sephadex  G-25  in  TBS. 
The  resulting  sample  (8.3  ml,  -70  mg/ml  of  protein)  was  applied  to  a 
column  (1  ml)  of  the  monoclonal  anti-FT.AG  M2  antibody  (Kodak).  The 
column  was  washed  three  limes  with  10  ml  of  TBS,  and  Cup9p-FLAG 
was  eluted  by  the  addition  of  five  1-ml  samples  of  TBS  containing, 
respectively,  50.  100,  100,  200  and  200  jag/mi  of  the  FLAG  peptide 
(Kodak).  Peak  Cup9p-FLAG  fractions  (detected  by  immunoblotting) 
were  concentrated  by  partial  lyophilization,  followed  by  precipitation 
with  methanol  (Wessel  and  Flugge.  1984)  in  the  presence  of  human 
insulin  (Sigma,  0.3  mg/ml)  as  a  carrier.  The  resulting  sample  was 
fractionated  by  SDS-12%  PAGE  and  electroblotted  onto  Pro-BIoi 
membrane  (Perkin-Elmer).  After  a  brief  staining  with  Coomassie.  the 
band  of  the  37  kDa  Cup9p-FLAG  (-15  pmol)  was  excised  from  the 
membrane.  Half  of  the  sample  was  used  to  determine  the  amino  acid 
composition;  the  other  half  was  subjected  to  N-terminal  sequencing  for 
seven  cycles,  using  the  Applied  Biosystems  476A  protein  sequencer  at 
the  Caltech  Microchemistry  Facility. 
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